Journal of Software Engineering and Applications

Volume 18, Issue 1 (January 2025)

ISSN Print: 1945-3116   ISSN Online: 1945-3124

Google-based Impact Factor: 2  Citations  

Intelligent ETL for Enterprise Software Applications Using Unstructured Data

  XML Download Download as PDF (Size: 1769KB)  PP. 44-65  
DOI: 10.4236/jsea.2025.181003    110 Downloads   666 Views  

ABSTRACT

Enterprise applications utilize relational databases and structured business processes, requiring slow and expensive conversion of inputs and outputs, from business documents such as invoices, purchase orders, and receipts, into known templates and schemas before processing. We propose a new LLM Agent-based intelligent data extraction, transformation, and load (IntelligentETL) pipeline that not only ingests PDFs and detects inputs within it but also addresses the extraction of structured and unstructured data by developing tools that most efficiently and securely deal with respective data types. We study the efficiency of our proposed pipeline and compare it with enterprise solutions that also utilize LLMs. We establish the supremacy in timely and accurate data extraction and transformation capabilities of our approach for analyzing the data from varied sources based on nested and/or interlinked input constraints.

Share and Cite:

Joshi, M. and Madisetti, V. (2025) Intelligent ETL for Enterprise Software Applications Using Unstructured Data. Journal of Software Engineering and Applications, 18, 44-65. doi: 10.4236/jsea.2025.181003.

Cited by

No relevant information.

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.