#sample json structure
{
"glossary": {
"title": "example glossary",
"GlossDiv": {
"title": "S",
"GlossList": {
"GlossEntry": {
"ID": "SGML",
"SortAs": "SGML",
"GlossTerm": "Standard Generalized Markup Language",
"Acronym": "SGML",
"Abbrev": "ISO 8879:1986",
"GlossDef": {
"para": "A meta-markup language, used to create markup languages such as DocBook.",
"GlossSeeAlso": ["GML", "XML"]
},
"GlossSee": "markup"
}
}
}
}
}
Extracted Schema will be
['glossary'] #for level 1
['glossary.title', 'glossary.GlossDiv'] #for level 2
['glossary.title', 'GlossDiv.title', 'GlossDiv.GlossList'] #for level 3
#and same goes on
#sample XML structure
<note>
<to id='title'>Tove</to>
<from>Jani</from>
<heading class='red'>Reminder</heading>
<body>Don't forget me this weekend!</body>
</note>
Extracted Schema will be
note
note/to
note/to/@title #attributes will be displayed followed by @
note/to/from
note/to/from/heading
note/to/from/heading/body
#sample .dbt, .dbf file content saved as .txt file
Table cost
ID source datatime fieldname
#some random data
Table Bill
ID department quantity
#some random data
Extracted Schema will be
['cost.ID', 'cost.source', 'cost.datetime', 'cost.fieldname', 'Bill.ID', 'Bill.department', 'Bill.quantity']