Explained how the raw data from Xquery becomes the said.csv
file
Data quality was the theme today:
said
post, was able to go back to the Digital Edition in the private repo and fix all of the missing attributes.said.csv
file.Also started a conversation around using emojis to label main characters.
speaker
and addressee
attributes in chapters 1 thru 12
If you see mistakes or want to suggest changes, please create an issue on the source repository.
For attribution, please cite this work as
Glachant (2022, Oct. 6). Data Ignota: Data Cleaning and Tidying. Retrieved from https://syvwlch.github.io/Data-Ignota/posts/2022-10-06-data-cleaning-and-tidying/
BibTeX citation
@misc{glachant2022data, author = {Glachant, Mathieu}, title = {Data Ignota: Data Cleaning and Tidying}, url = {https://syvwlch.github.io/Data-Ignota/posts/2022-10-06-data-cleaning-and-tidying/}, year = {2022} }