Juan F. Sequeda is the Principal Scientist at data.world. He joined through the acquisition of Capsenta, a company he founded as a spin-off from his research. He holds a PhD in Computer Science from The University of Texas at Austin.
Juan is the recipient of the NSF Graduate Research Fellowship, received 2nd Place in the 2013 Semantic Web Challenge for his work on ConstituteProject.org, Best Student Research Paper at the 2014 International Semantic Web Conference and the 2015 Best Transfer and Innovation Project awarded by the Institute for Applied Informatics. Juan is on the Editorial Board of the Journal of Web Semantics, member of multiple program committees (ISWC, ESWC, WWW, AAAI, IJCAI). He was the General Chair of AMW2018, PC chair of ISWC 2017 In-Use track, co-creator of COLD workshop (7 years co-located at ISWC). He has served as a bridge between academia and industry as the current chair of the Property Graph Schema Working Group, member of the Graph Query Languages task force of the Linked Data Benchmark Council (LDBC) and past invited expert member and standards editor at the World Wide Web Consortium (W3C).
Wearing his scientific hat, Juan's goal is to reliably create knowledge from inscrutable data. His research interests are on the intersection of Logic and Data for (ontology-based) data integration and semantic/graph data management, and what now is called Knowledge Graphs.
Wearing his business hat, Juan is a product manager, does business development and strategy, technical sales and works with customers to understand their problems to translated back to R&D.
Knowledge Graphs, Semantic Web, Databases, (Ontology-Based) Data Integration, Semantic and Graph Data Management
You can usually find me in Austin, Texas or on a plane.
Work: juan [at] data.world
Personal: juanfederico [at] gmail.com
A common frustration we’ve encountered is the lack of adequate tooling around ontology and knowledge graph schema design. Many tools exist– some are overly complex, some are very expensive, and none allow one to work visually, collaboratively and in real time on a document with multiple concurrent users. That is why we took half a dozen Capsenta engineers and spent over two years to design and develop a solution.
The result is Gra.fo, a visual, collaborative, and real-time ontology and knowledge graph schema editor. For more info check out the launch blog post.
Industry graph database implement query languages are not composable: graphs in and NOT graphs out. Without a composable query language, graph databases lack key features such as subqueries and views. Imagine if we could design a graph query language from scratch. Furthermore imagine we could create a group of really smart people coming from academia and industry, from theory and systems and work together for over 2 years in order to help shape the future of graph query languages.
We argue that existing graph database management systems should consider supporting two key characteristics. First, it should be composable, meaning, that graphs are the input and the output of queries. Second, the graph query language should treat paths as first-class citizens. Our result is G-CORE, a powerful graph that makes these goals, and strikes a careful balance between expressivity and evaluation complexity.
More info: http://g-core.org/
Capsenta is an early-stage enterprise software startup, founded by my PhD advisor Daniel Miranker and myself. The company spun out from the Department of Computer Science at the University of Texas at Austin in 2015. Capsenta is commercializing Ultrawrap, the technology developed during my PhD research. Ultrawrap virtualizes relational databases as semantic web data sources (RDF, OWL, SPARQL) which has enabled us to develop a semantic data virtualization platform used for data integration. At Capsenta, we are changing the way enterprises models, governs and integrates data.
I am lucky to have the opportunity to work with our customers and identify socio-technical research challenges in enterprise data integration which we will continue to investigate.
More info: https://capsenta.com/
The Semantic Web’s promise to achieve web-wide data integration requires the inclusion of legacy relational data as RDF, which, in turn, requires the execution of SPARQL queries on the legacy relational database. In this research, we explore a hypothesis: existing commercial relational databases already subsume the algorithms and optimizations needed to support effective SPARQL execution on existing relationally stored data .
Ultrawrap is a Relational Database to RDF (RDB2RDF) system, which encodes a logical representation of the database as a graph using SQL views and a simple syntactic translation of SPARQL queries to SQL queries on those views. Thus, in the course executing a SPARQL query, the SQL optimizer both instantiates a mapping of relational data to RDF and optimizes its execution. Other approaches, such as D2RQ, typically implement aspects of query optimization and execution outside the SQL environment.
Ultrawrap has been evaluated with the Barton and Berlin SPARQL Benchmark across the three major relational database management systems. We identify two important optimizations: detection of unsatisfiable conditions and self-join elimination, such that, when applied, SPARQL queries execute at nearly the same speed as semantically equivalent native SQL queries.
Ultrawrap is currently being commercialized by Capsenta. Please contact ultrawrap@capsenta.com for inquiries.
More info: https://capsenta.com/
Constitutional design and redesign is constant. Over the last 200 years, countries have replaced their constitutions an average of every 19 years and some have amended them almost yearly. A basic problem in the drafting of these documents is the search and analysis of model text deployed in other jurisdictions. Traditionally, this process has been ad hoc and the results suboptimal. As a result, drafters generally lack systematic information about the institutional options and choices available to them. In order to address this informational need, we have developed a web application, Constitute, with the use of semantic technologies. Constitute provides searchable access to the world’s constitutions using the conceptualization, texts, and data developed by the Comparative Constitutions Project. An OWL ontology represents 330 “topics” –e.g. right to health– with which we have tagged relevant provisions of nearly all constitutions in force. The tagged texts were then converted to an RDF representation using R2RML mappings and Capsenta’s Ultrawrap. The portal implements semantic search features to allow constitutional drafters to read, search, and compare the world’s constitutions. The goal of the project is to improve the efficiency and systemization of constitutional design and, thus, to support the independence and self-reliance of constitutional drafters.
More info: https://www.constituteproject.org/