AINeutralarXiv – CS AI · 15h ago5/10
🧠
Conceptual Schema Inference for Tabular Datasets using Large Language Models
Researchers propose LLM-based approaches (GeSI and EmSI) to automatically infer conceptual schemas from heterogeneous tabular datasets by analyzing column headers and cell values. The methods address the challenge of organizing large, inconsistent data collections from diverse sources by deriving entity types, attributes, and relationships without manual intervention.