CAS-Based Approach for Automatic Data Integration


Research of automatic integration of structured and semi-structured data has not resulted in success over the past fifty years. No theory of data integration exists. It is unknown what the theoretical necessary requirements are, to fully support automatic data integration from autonomous heterogeneous data sources. Therefore, it is not possible to objectively evaluate if and how much new algorithms, techniques, and specifically Data Definition Languages, move towards meeting such theoretical requirements. To overcome the serious reverse salient the field and industry are in, it will be helpful if a data integration theory would be developed. This article proposes a new look at data integration by using complex adaptive systems principles to analyze current shortcomings and propose a direction that may lead to a data integration theory.

Share and Cite:

E. Rohn, "CAS-Based Approach for Automatic Data Integration," American Journal of Operations Research, Vol. 3 No. 1A, 2013, pp. 181-186. doi: 10.4236/ajor.2013.31A017.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] R. E. Knox, “The XML Family of Standards: Four Years Later,” Gartner Group, Stamford, 2001.
[2] R. E. Knox and C. Abrams, “Hype Cycle for XML Technologies for 2003,” Gartner Group, Stamford, 2003.
[3] R. E. Knox, C. Abrams, T. Friedman, D. Feinberg, K. Harris-Ferrante and D. Logan, “Hype Cycle for XML Technologies, 2006,” Gartner Group, Stamford, 2006.
[4] C. Hsu, “The Metadatabase Project at Rensselaer,” ACM SIGMOD Record, Vol. 20, No. 4, 1991, pp. 83-90. doi:10.1145/141356.141394
[5] C. Shu, “Metadatabase: An Information Integration Theory and Reference Model,” 2003.
[6] J. Clark and S. DeRose, “XML Path Language (XPath) Version 1.0,” W3C Recommendation, 1999.
[7] M. Fernandez, D. Florescu, J. Kang, A. Levy and D. Suciu, “STRUDEL: A Web Site Management System,” Proceedings of the 1997 ACM SIGMOD international conference on Management of data, Tucson, 13-15 May 1997, pp. 549-552.
[8] D. Draper, A. Halevy and D. S. Weld, “The Nimble XML Data Integration System,” Nimble Technology, Inc., San Jose, 2001.
[9] T. Lahiri, S. Abiteboul and J. Widom, “Ozone: Integrating Structured and Semistructured Data,” Revised Papers from the 7th International Workshop on Database Programming Languages: Research Issues in Structured and Semistructured Database Programming, Springer-Verlag, Heidelberg, 2000.
[10] S. Bechhofer, I. Horrocks, C. Goble and R. Stevens, “OilEd: A Reason-able Ontology Editor for the Semantic Web,” In: C. A. Goble, D. L. McGuinness, R. Moller and P. F. Patel-Schneider, Eds., Working Notes of the 2001 International Description Logics Workshop (DL-2001),, Stanford, 2001, pp.
[11] WEBONT, “W3C DAML+OIL Project,” W3C Web-Ontology Working Group, 2009.
[12], “DAML Ontology Library,” US Government (DARPA), Arlington, 2004.
[13] M. Greaves, “2004 DAML Program Directions,” DAML. org, Arlington, 2004.
[14] S. Dustdar, R. Pichler, V. Savenkov and H.-L. Truong, “Quality-Aware Service-Oriented Data Integration: Requirements, State of the Art and Open Challenges,” SIGMOD Record, Vol. 41, No. 1, 2012, pp. 11-19. doi:10.1145/2206869.2206873
[15] Y. Peng, Y. Zhang, Y. Tang and S. Li, “An Incident Information Management Framework Based on Data Integration, Data Mining, and Multi-Criteria Decision Making,” Decision Support Systems, 51 (2011) 316-327. doi:10.1016/j.dss.2010.11.025
[16] R. W. Ashby, “An Introduction to Cybernetics,” Chapman & Hall, London, 1956.
[17] Y. Bar-Yam, “Dynamics of Complex Systems,” Westview Press, Boulder, 1997.
[18] W. Buckley, “Society—A Complex Adaptive System,” Gordon and Breach Publishers, Amsterdam, 1998.
[19] N. Wiener, “Cybernetics or Control and Communication in the Animal and the Machine,” MIT Press, Cambridge, 1948.
[20] W. Buckley, “Sociology and Modern Systems Theory,” Prentice-Hall, Inc., Englewood Cliffs, 1967.
[21] R. C. Raymond, “Communications, Entropy, and Life,” American Scientist, Vol. 38, No. 4, 1950, pp. 273-278.
[22] C. E. Shannon, “A Mathematical Theory of Communication,” Bell Systems Technical Journal, Vol. 27, No. 3, 1948, 379-423.
[23] R. W. Ashby, “Adaptiveness and Equilibrium,” Journal of Mental Science, Vol. 86, No. 5, 1940, pp. 478-484.
[24] R. W. Ashby, “The Nervous System as Physical Machine: With Special Reference to the Origin of Adaptive Behavior,” Mind, Vol. 56, No. 221, 1947, pp. 44-59.
[25] J. L. Casti, “Canonical Models and the Law of Requisite Variety,” Journal of Optimization Theory and Applications, Vol. 46, No. 4, 1985, pp. 455-459.
[26] MMI.ORG, “Marine Metadata Interoperability Ontology Registry and Repository,” 2012.
[27] OOR, “Open Ontology Repository,” 2012.
[28] OASIS, “OASIS Advances CAP and Emergency Data Exchange Language (EDXL) Specifications,” 2006.
[29] XML.GOV, “Registires,” 2007.
[30] Protege, “Ontologies Registry,” 2007.
[31] E. Rohn, “Generational Analysis of Variety in Data Structures: Impact on Automatic Data Integration and on the Semantic Web,” Journal of Knowledge and Information Systems, Vol. 24, No. 2, 2009, pp. 283-304.
[32] E. Rohn, “Generational Analysis of Tension and Entropy in Data Structures: Impact on Automatic Data Integration and on the Semantic Web,” Knowledge and Information Systems, Vol. 28, No. 1, 2010, pp. 175-196.
[33] J. F. Sowa, “Worlds, Models and Descriptions,” Studia Logica, Vol. 84, No. 2, 2006, pp. 323-360.
[34] E. D. Sontag, “Mathematical Control Theory, Deterministic Finite Dimensional Systems,” 2nd Edition, Springer-Verlag, Heidelberg, 1998.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.