Quando abbiamo un file .csv (comma separated value) è relativamente semplice poterlo trasformare in un .xlm:
con un applicativo tipo foglio di stile (meglio LibreOffice, del quale non paghiamo licenza a nessuno!) posssiamo aprire il nostro .csv, facendo attenzione al set di caratteri che si applica (in genere UTF-8), e identificando i marcatori che separano le colonne (tipicamente la virgola, il tabulatore, le virgolette…). Potremo così verificare i dati del file che vogliamo trasformare. Una volta salvato, copieremo tutto il testo nel blocco note per incollarlo in uno dei servizi gratuiti on-line per la trasformazione di .csv in .xml, come http://creativyst.com/Prod/15/. Una volta ottenuto il codice tradotto, potremo copiarlo ed incollarlo in un blocco note per salvarlo con estensione .xml.
La successiva trasformazione utilizza una libreria (http://xmlsoft.org/XSLT/xsltproc2.html) che implementa fogli di stile xlst (come http://www.gac-grid.org/project-products/Software/XML2RDF.html). Installando tale libreria e utilizzando il filtro xml2rdf3.xsl, otterremo il nostro file in formato rdf.
Il risultato semantico delle trasformazioni dipenderà molto dalla conformazione del file.csv iniziale. Sarà possibile nel corso delle trasformazioni aggiornare i riferimenti semantici con vocabolari (namespaces) che attribuiscano significato intelleggible alle macchine (ossia con meccanismi di contrasto e reinvio ad indirizzi http), nel buon rispetto delle triplette RDF.
Esempio delle trasformazioni:
Contenuto orginale del flie.csv, prime righe di dati:
IDGREC,Journal Title,dc.identifier.ISSN,dc.publisher,Policy/ies,Copyrights of the,Other,Copyrights type,Publication type,Months embargo,Embargo start,Embargo stop
33,Acta Botánica Barcinonensia,0210-7597, Departament de Biologia Vegetal (Unitat de Botànica). Universitat de Barcelona ,1,Autor,,TDR – Tots els drets reservats,c – Publicat,0,01/01/1901,
44,Acta Geologica Hispanica,0567-7505, Universitat de Barcelona (UB). Institut de Ciències de la Terra Jaume Almera (ICTJA) ,1,Autor,,TDR – Tots els drets reservats,c – Publicat,0,01/01/1901,
48,Acta Historica et Archaeologica Mediaevalia,0212-2960, Universitat de Barcelona ,1,Autor,,TDR – Tots els drets reservats,c – Publicat,,,
Si noti la prima linea costituita dai titolo (headings) che corrispondono al nome delle colonne.
Seguono poi tre gruppi di dati, ognuno rappresentante un item.
Contenuto della prima trasformazione a .xml:
<document>
<row>
<IDGREC>33</IDGREC >
<Journal_Title>Acta Botánica Barcinonensia</Journal_Title >
<ISSN>0210-7597</ISSN >
<Editor>Departament de Biologia Vegetal (Unitat de Botànica). Universitat de Barcelona</Editor >
<Policy>1</Policy >
<Copyrights_of_the>Autor</Copyrights_of_the >
<Other></Other >
<Copyrights_type>TDR – Tots els drets reservats</Copyrights_type >
<Publication_type>c – Publicat</Publication_type >
<Months_embargo>0</Months_embargo >
<Embargo_start>01/01/1901</Embargo_start >
<Embargo_stop></Embargo_stop >
</row>
<row>
<IDGREC>44</IDGREC >
<Journal_Title>Acta Geologica Hispanica</Journal_Title >
<ISSN>0567-7505</ISSN >
<Editor>Universitat de Barcelona (UB). Institut de Ciències de la Terra Jaume Almera (ICTJA)</Editor >
<Policy>1</Policy >
<Copyrights_of_the>Autor</Copyrights_of_the >
<Other></Other >
<Copyrights_type>TDR – Tots els drets reservats</Copyrights_type >
<Publication_type>c – Publicat</Publication_type >
<Months_embargo>0</Months_embargo >
<Embargo_start>01/01/1901</Embargo_start >
<Embargo_stop></Embargo_stop >
</row>
<row>
<IDGREC>48</IDGREC >
<Journal_Title>Acta Historica et Archaeologica Mediaevalia</Journal_Title >
<ISSN>0212-2960</ISSN >
<Editor>Universitat de Barcelona</Editor >
<Policy>1</Policy >
<Copyrights_of_the>Autor</Copyrights_of_the >
<Other></Other >
<Copyrights_type>TDR – Tots els drets reservats</Copyrights_type >
<Publication_type>c – Publicat</Publication_type >
<Months_embargo></Months_embargo >
<Embargo_start></Embargo_start >
<Embargo_stop></Embargo_stop >
</row>
Ultima trasformazione a .rdf:
<?xml version=”1.0″?>
<rdf:RDF xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#”>
<rdf:Description rdf:about=””>
<document xmlns=”#”>
<rdf:Description rdf:about=”#document”>
<row>
<rdf:Description rdf:about=”#document/row”>
<IDGREC>
<rdf:Description rdf:about=”#document/row/IDGREC”>
<rdf:value>33</rdf:value>
</rdf:Description>
</IDGREC>
<rdf:_1>
<rdf:Description rdf:about=”#document/row/IDGREC”/>
</rdf:_1>
<Journal_Title>
<rdf:Description rdf:about=”#document/row/Journal_Title”>
<rdf:value>Acta Botánica Barcinonensia</rdf:value>
</rdf:Description>
</Journal_Title>
<rdf:_2>
<rdf:Description rdf:about=”#document/row/Journal_Title”/>
</rdf:_2>
<ISSN>
<rdf:Description rdf:about=”#document/row/ISSN”>
<rdf:value>0210-7597</rdf:value>
</rdf:Description>
</ISSN>
<rdf:_3>
<rdf:Description rdf:about=”#document/row/ISSN”/>
</rdf:_3>
<Editor>
<rdf:Description rdf:about=”#document/row/Editor”>
<rdf:value>Departament de Biologia Vegetal (Unitat de Botànica). Universitat de Barcelona</rdf:value>
</rdf:Description>
</Editor>
<rdf:_4>
<rdf:Description rdf:about=”#document/row/Editor”/>
</rdf:_4>
<Policy>
<rdf:Description rdf:about=”#document/row/Policy”>
<rdf:value>1</rdf:value>
</rdf:Description>
</Policy>
<rdf:_5>
<rdf:Description rdf:about=”#document/row/Policy”/>
</rdf:_5>
<Copyrights_of_the>
<rdf:Description rdf:about=”#document/row/Copyrights_of_the”>
<rdf:value>Autor</rdf:value>
</rdf:Description>
</Copyrights_of_the>
<rdf:_6>
<rdf:Description rdf:about=”#document/row/Copyrights_of_the”/>
</rdf:_6>
<Other>
<rdf:Description rdf:about=”#document/row/Other”/>
</Other>
<rdf:_7>
<rdf:Description rdf:about=”#document/row/Other”/>
</rdf:_7>
<Copyrights_type>
<rdf:Description rdf:about=”#document/row/Copyrights_type”>
<rdf:value>TDR – Tots els drets reservats</rdf:value>
</rdf:Description>
</Copyrights_type>
<rdf:_8>
<rdf:Description rdf:about=”#document/row/Copyrights_type”/>
</rdf:_8>
<Publication_type>
<rdf:Description rdf:about=”#document/row/Publication_type”>
<rdf:value>c – Publicat</rdf:value>
</rdf:Description>
</Publication_type>
<rdf:_9>
<rdf:Description rdf:about=”#document/row/Publication_type”/>
</rdf:_9>
<Months_embargo>
<rdf:Description rdf:about=”#document/row/Months_embargo”>
<rdf:value>0</rdf:value>
</rdf:Description>
</Months_embargo>
<rdf:_10>
<rdf:Description rdf:about=”#document/row/Months_embargo”/>
</rdf:_10>
<Embargo_start>
<rdf:Description rdf:about=”#document/row/Embargo_start”>
<rdf:value>01/01/1901</rdf:value>
</rdf:Description>
</Embargo_start>
<rdf:_11>
<rdf:Description rdf:about=”#document/row/Embargo_start”/>
</rdf:_11>
<Embargo_stop>
<rdf:Description rdf:about=”#document/row/Embargo_stop”/>
</Embargo_stop>
<rdf:_12>
<rdf:Description rdf:about=”#document/row/Embargo_stop”/>
</rdf:_12>
</rdf:Description>
</row>
<rdf:_1>
<rdf:Description rdf:about=”#document/row”/>
</rdf:_1>
<row>
<rdf:Description rdf:about=”#document/row_2″>
<IDGREC>
<rdf:Description rdf:about=”#document/row_2/IDGREC”>
<rdf:value>44</rdf:value>
</rdf:Description>
</IDGREC>
<rdf:_1>
<rdf:Description rdf:about=”#document/row_2/IDGREC”/>
</rdf:_1>
<Journal_Title>
<rdf:Description rdf:about=”#document/row_2/Journal_Title”>
<rdf:value>Acta Geologica Hispanica</rdf:value>
</rdf:Description>
</Journal_Title>
<rdf:_2>
<rdf:Description rdf:about=”#document/row_2/Journal_Title”/>
</rdf:_2>
<ISSN>
<rdf:Description rdf:about=”#document/row_2/ISSN”>
<rdf:value>0567-7505</rdf:value>
</rdf:Description>
</ISSN>
<rdf:_3>
<rdf:Description rdf:about=”#document/row_2/ISSN”/>
</rdf:_3>
<Editor>
<rdf:Description rdf:about=”#document/row_2/Editor”>
<rdf:value>Universitat de Barcelona (UB). Institut de Ciències de la Terra Jaume Almera (ICTJA)</rdf:value>
</rdf:Description>
</Editor>
<rdf:_4>
<rdf:Description rdf:about=”#document/row_2/Editor”/>
</rdf:_4>
<Policy>
<rdf:Description rdf:about=”#document/row_2/Policy”>
<rdf:value>1</rdf:value>
</rdf:Description>
</Policy>
<rdf:_5>
<rdf:Description rdf:about=”#document/row_2/Policy”/>
</rdf:_5>
<Copyrights_of_the>
<rdf:Description rdf:about=”#document/row_2/Copyrights_of_the”>
<rdf:value>Autor</rdf:value>
</rdf:Description>
</Copyrights_of_the>
<rdf:_6>
<rdf:Description rdf:about=”#document/row_2/Copyrights_of_the”/>
</rdf:_6>
<Other>
<rdf:Description rdf:about=”#document/row_2/Other”/>
</Other>
<rdf:_7>
<rdf:Description rdf:about=”#document/row_2/Other”/>
</rdf:_7>
<Copyrights_type>
<rdf:Description rdf:about=”#document/row_2/Copyrights_type”>
<rdf:value>TDR – Tots els drets reservats</rdf:value>
</rdf:Description>
</Copyrights_type>
<rdf:_8>
<rdf:Description rdf:about=”#document/row_2/Copyrights_type”/>
</rdf:_8>
<Publication_type>
<rdf:Description rdf:about=”#document/row_2/Publication_type”>
<rdf:value>c – Publicat</rdf:value>
</rdf:Description>
</Publication_type>
<rdf:_9>
<rdf:Description rdf:about=”#document/row_2/Publication_type”/>
</rdf:_9>
<Months_embargo>
<rdf:Description rdf:about=”#document/row_2/Months_embargo”>
<rdf:value>0</rdf:value>
</rdf:Description>
</Months_embargo>
<rdf:_10>
<rdf:Description rdf:about=”#document/row_2/Months_embargo”/>
</rdf:_10>
<Embargo_start>
<rdf:Description rdf:about=”#document/row_2/Embargo_start”>
<rdf:value>01/01/1901</rdf:value>
</rdf:Description>
</Embargo_start>
<rdf:_11>
<rdf:Description rdf:about=”#document/row_2/Embargo_start”/>
</rdf:_11>
<Embargo_stop>
<rdf:Description rdf:about=”#document/row_2/Embargo_stop”/>
</Embargo_stop>
<rdf:_12>
<rdf:Description rdf:about=”#document/row_2/Embargo_stop”/>
</rdf:_12>
</rdf:Description>
</row>
<rdf:_2>
<rdf:Description rdf:about=”#document/row_2″/>
</rdf:_2>
<row>
<rdf:Description rdf:about=”#document/row_3″>
<IDGREC>
<rdf:Description rdf:about=”#document/row_3/IDGREC”>
<rdf:value>48</rdf:value>
</rdf:Description>
</IDGREC>
<rdf:_1>
<rdf:Description rdf:about=”#document/row_3/IDGREC”/>
</rdf:_1>
<Journal_Title>
<rdf:Description rdf:about=”#document/row_3/Journal_Title”>
<rdf:value>Acta Historica et Archaeologica Mediaevalia</rdf:value>
</rdf:Description>
</Journal_Title>
<rdf:_2>
<rdf:Description rdf:about=”#document/row_3/Journal_Title”/>
</rdf:_2>
<ISSN>
<rdf:Description rdf:about=”#document/row_3/ISSN”>
<rdf:value>0212-2960</rdf:value>
</rdf:Description>
</ISSN>
<rdf:_3>
<rdf:Description rdf:about=”#document/row_3/ISSN”/>
</rdf:_3>
<Editor>
<rdf:Description rdf:about=”#document/row_3/Editor”>
<rdf:value>Universitat de Barcelona</rdf:value>
</rdf:Description>
</Editor>
<rdf:_4>
<rdf:Description rdf:about=”#document/row_3/Editor”/>
</rdf:_4>
<Policy>
<rdf:Description rdf:about=”#document/row_3/Policy”>
<rdf:value>1</rdf:value>
</rdf:Description>
</Policy>
<rdf:_5>
<rdf:Description rdf:about=”#document/row_3/Policy”/>
</rdf:_5>
<Copyrights_of_the>
<rdf:Description rdf:about=”#document/row_3/Copyrights_of_the”>
<rdf:value>Autor</rdf:value>
</rdf:Description>
</Copyrights_of_the>
<rdf:_6>
<rdf:Description rdf:about=”#document/row_3/Copyrights_of_the”/>
</rdf:_6>
<Other>
<rdf:Description rdf:about=”#document/row_3/Other”/>
</Other>
<rdf:_7>
<rdf:Description rdf:about=”#document/row_3/Other”/>
</rdf:_7>
<Copyrights_type>
<rdf:Description rdf:about=”#document/row_3/Copyrights_type”>
<rdf:value>TDR – Tots els drets reservats</rdf:value>
</rdf:Description>
</Copyrights_type>
<rdf:_8>
<rdf:Description rdf:about=”#document/row_3/Copyrights_type”/>
</rdf:_8>
<Publication_type>
<rdf:Description rdf:about=”#document/row_3/Publication_type”>
<rdf:value>c – Publicat</rdf:value>
</rdf:Description>
</Publication_type>
<rdf:_9>
<rdf:Description rdf:about=”#document/row_3/Publication_type”/>
</rdf:_9>
<Months_embargo>
<rdf:Description rdf:about=”#document/row_3/Months_embargo”/>
</Months_embargo>
<rdf:_10>
<rdf:Description rdf:about=”#document/row_3/Months_embargo”/>
</rdf:_10>
<Embargo_start>
<rdf:Description rdf:about=”#document/row_3/Embargo_start”/>
</Embargo_start>
<rdf:_11>
<rdf:Description rdf:about=”#document/row_3/Embargo_start”/>
</rdf:_11>
<Embargo_stop>
<rdf:Description rdf:about=”#document/row_3/Embargo_stop”/>
</Embargo_stop>
<rdf:_12>
<rdf:Description rdf:about=”#document/row_3/Embargo_stop”/>
</rdf:_12>
</rdf:Description>
</row>
Gli stessi tre item espressi in rdf, seppur semanticamente ancora da sistemare…