z-logo
open-access-imgOpen Access
PPORTAL: Public Domain Portuguese-language Literature Dataset
Author(s) -
Mariana O. Silva,
Clarisse Scofield,
Mirella M. Moro
Publication year - 2021
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/dsw.2021.17416
Subject(s) - metadata , portuguese , computer science , context (archaeology) , domain (mathematical analysis) , process (computing) , publishing , data science , public domain , resource (disambiguation) , world wide web , information retrieval , linguistics , geography , political science , archaeology , mathematical analysis , computer network , philosophy , mathematics , law , operating system
Combining human expertise with book-consumers data may generate what is needed to sustain constant changes experienced in the book publishing market. Then, building and making available datasets that entirely comprise the essential elements of the book industry ecosystem is essential. However, little has been done in such a context for non-English languages, such as Portuguese. Hence, we introduce PPORTAL, a public domain Portuguese-language literature dataset composed of books-related metadata. After an overview of its building process and content, we discuss a brief exploratory data analysis to summarize its main characteristics. We also highlight potential applications, showing how PPORTAL is useful as a resource on different research domains.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here