Autonomous Data Exchange: The Malady and a Possible Path to Its Cure

Abstract

Data exchange is a goal-oriented social communications system implemented through computerized technology. Data definition languages (DDLs) provide the syntax for communicating within and between organizations, illocutionary acts, such as informing, ordering and warning. Data exchange results in meaning-preserving mapping between an ensemble (a constrained variety) and its external (unconstrained) variety. Research on unsupervised structured and semi-structured data exchange has not produced any significant successes over the past fifty years. As a step towards finding a solution, this article proposes a new look at data exchange by using the principles of complex adaptive systems (CAS) to analyze current shortcomings and to propose a direction that may indeed lead to workable and mathematically grounded solution. Three CAS attributes key to this research are variety, tension and entropy. We use them to show that older and contemporary DDLs are identical in their core, thus explaining why even XML and Ontologies have failed to a create fully automated data exchange mechanism. Then we show that it is possible to construct a radically different DDL that overcomes existing data exchange limitations—its variety, tension and entropy are different from existing solutions. The article has these major parts: definition of key CAS attributes; quantitative examination of representative old and new DDLs using these attributes; presentation of the results and their pessimistic ramification; a section that proposes a new theoretical way to construct DDLs that is based entirely on CAS principles, thus enabling unsupervised data exchange. The theory is then tested, showing very promising results.

Share and Cite:

Rohn, E. (2015) Autonomous Data Exchange: The Malady and a Possible Path to Its Cure. Intelligent Information Management, 7, 22-32. doi: 10.4236/iim.2015.71003.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Searle, J.R. (1969) Speech Acts: An Essay in the Philosophy of Language. Cambridge University Press, Cambridge. http://dx.doi.org/10.1017/CBO9781139173438
[2] Batini, C., Lenzerini, M. and Navathe, S. (1986) A Comparative Analysis of Methodologies for Database Schema Integration. ACM Computing Surveys, 18, 323-364.
http://dx.doi.org/10.1145/27633.27634
[3] Hunter, A. and Liu, W. (2006) Merging Uncertain Information with Semantic Heterogeneity in XML. Knowledge and Information Systems, 9, 230-258. http://dx.doi.org/10.1007/s10115-005-0220-y
[4] Flahive, A., Taniar, D., Rahayu, W. and Apduhan, B.O. (2012) A Methodology for Ontology Update in the Semantic grid Environment. Concurrency and Computation: Practice and Experience.
[5] Mao, M., Peng, Y. and Spring, M. (2011) Ontology Mapping: As a Binary Classification Problem. Concurrency and Computation: Practice and Experience, 23, 1010-1025.
http://dx.doi.org/10.1002/cpe.1633
[6] Sowa, J.F. (2001) Meaning-Preserving Translations.
http://users.bestweb.net/~sowa/logic/meaning.htm
[7] Sowa, J.F. (2006) Worlds, Models and Descriptions. Studia Logica, 84, 323-360.
http://dx.doi.org/10.1007/s11225-006-9012-y
[8] Ashby, R.W. (1940) Adaptiveness and Equilibrium. Journal of Mental Science, 86, 478-484.
[9] Ashby, R.W. (1956) An Introduction to Cybernetics. Chapman & Hall, London.
[10] Ashby, R.W. (1947) The Nervous System as Physical Machine: With Special Reference to the Origin of Adaptive Behavior. Mind, 56, 44-59.
[11] Casti, J.L. Canonical Models and the Law of Requisite Variety. Journal of Optimization Theory and Applications, 46.
[12] Buckley, W. (1967) Sociology and Modern Systems Theory. Prentice-Hall, Inc., Englewood Clifs.
[13] Raymond, R.C. (1950) Communication, Entropy, and Life. American Scientist, 38, 273-278.
[14] Shannon, C.E. (1948) A Mathematical Theory of Communication. Bell System Technical Journal, 27, 379-423. http://dx.doi.org/10.1002/j.1538-7305.1948.tb01338.x
[15] Dustdar, S., Pichler, R., Savenkov, V. and Truong, H.-L. (2012) Quality-Aware Service-Oriented Data Integration: Requirements, State of the Art and Open Challenges. SIGMOD Record, 41, 11-19.
http://dx.doi.org/10.1145/2206869.2206873
[16] Halevy, A., Rajaraman, A. and Ordille, J. (2006) Data Integration: The Teenage Years. In: Dayal, U., Whang, K.-Y., Lomet, D., Alonso, G., Lohman, G., Kersten, M., Cha, S.K. and Kim, Y.-K., Eds., Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB’06), VLDB Endowment, 9-16.
[17] Vincent, M.W., Liu, J. and Mohania, M. (2012) The Implication Problem for “Closest Node” Functional Dependencies in Complete XML Documents. Journal of Computer and System Sciences, 78, 1045-1098. http://dx.doi.org/10.1016/j.jcss.2012.01.003
[18] Zipf, G.K. (1949) Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology. Addison-Wesley, Reading.
[19] Miller, G.A. (1995) WordNet: A Lexical Database for English. Communications of the ACM, 38, 39-41.
http://dx.doi.org/10.1145/219717.219748
[20] WordNet (2005) WordNet Website. Princeton University, Princeton.
[21] Thue, A. (1914) Probleme über Veränderungen von Zeichenreihen nach gegebener Regeln. Skriftei utgit av Videnskopsselskapet i Kristiania, I, atematisknaturvidenskabelig klasse, No. 10.
[22] Weisstein, E.W. (2010) Substitution System. http://mathworld.wolfram.com/SubstitutionSystem.html

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.