Abstract Conceptual data models describe information systems without the burden of implementation details, and are increasingly used to generate code. They could also be analyzed for consistency and to generate test data except that the expressive constraints supported by popular modeling notations make such analysis intractable. In an earlier empirical study of conceptual models created at LogicBlox Inc., Smaragdakis, Csallner, and Subramanian found that a restricted subset of ORM, called ORM-, includes the vast majority of constraints used in practice and, moreover, allows scalable analysis. After that study, however, LogicBlox Inc. obtained a new ORM modeling tool, which supports discovery and specification of more complex constraints than the previous tool. We report findings of a follow-up study of models constructed using the more powerful tool. Our study finds that LogicBlox developers increasingly rely on a small number of features not in the ORM- subset. We extend ORM- with support for two of them: objectification and a restricted class of external uniqueness constraints. The extensions significantly improve our ability to analyze the ORM models created by developers using the new tool. We also show that a recent change to ORM has rendered the original ORM- algorithms unsound, in general; but that an efficient test suffices to show that these algorithms are in fact sound for the ORM-constraints appearing in any of the models currently in use at LogicBlox.
Scalable Analysis of Conceptual Data Models
-
Algebraic Structures for Capturing the Provenance of SPARQL Queries
Shan Shan Huang, , Publications, 0
Congratulations to LogicBlox team member Grigoris Karvounarakis on his recent publication in ICDT 2013: 16thInternational Conference on Database Theory....
-
Taming the Wildcards: Combining Definition- and Use-Site Variance
Shan Shan Huang, , Publications, 0
While the focus of our language work at LogicBlox is on LogiQL, a declarative query language, we do, together...
-
Exception Analysis and Points-to Analysis: Better Together
Shan Shan Huang, , Publications, 0
In this ISSTA 2009 publication, LogicBlox team member Martin Bravenboer and academic collaborator Yannis Smaragdakis write about how Doop,...
-
Win-move is Coordination Free… Sometimes
Shan Shan Huang, , Publications, 0
Coordination barriers are a major source of inefficiency in distributed query evaluation. Identifying queries that can be evaluated in...
-
LogicBlox, Platform and Language: a Tutorial
Shan Shan Huang, , Publications, 0
Datalog is the mathematical foundation to LogiQL, the LogicBlox query language. We here take a keen interest in fostering...
-
Simulation of Database-valued Markov Chains Using SimSQL
Shan Shan Huang, , Publications, 0
Congratulations to LogicBlox team member Zografoula Vagena on her recent publication in SIGMOD 2013, the premier international conference in...
-
ERBlox: Combining Matching Dependencies with Machine Learning for Entity Resolution
Shan Shan Huang, , Publications, 0
Entity resolution — determining multiple distinct pieces of data as identifiers for the same real-world object — is a...
-
SecureBlox: Customizable Secure Distributed Data Processing
Shan Shan Huang, , Publications, 0
The application of Datalog to the area of security, both in analysis and in the specification of rules, has...