metadata

New Data Curators Wanted

Our curators help us vocalize the needs of their domain, be it detecting gender biases of recommending engines, audience data on cultural heritage, or detecting evidence of greenwashing, and evaluates if the data that we come up with is directly usable and actionable.

Cultural & Creative Sectors and Industries Observatory, Daniel Antal

Nov 9, 2022 7 min read

New Data Curators Wanted

stacodelists: use standard, language-independent variable codes to help international data interoperability and machine reuse in R

A new building block of our observatories went through code peer review and was released yesterday. The statcodelists R package aim to promote the reuse and exchange of statistical information and related metadata with making the internationally standardized SDMX code lists available for the R user.

Daniel Antal

Jun 29, 2022 3 min read

stacodelists: use standard, language-independent variable codes to help international data interoperability and machine reuse in R

Ensuring the Visibility and Accessibility of European Creative Content on the World Market: The Need for Copyright Data Improvement in the Light of New Technologies

The lacking strategy to organize data and metadata in a multilingual Europe puts creators opens up the biggest cultural market of the world to American competition.

Martin Senfleben, Thomas Margoni, Daniel Antal, Balazs Bodó, Stef van Gompel, Christian Handke, Martin Kretschmer, Joost Poort, João Quintais, Sebastian Felix Schwemer

Ensuring the Visibility and Accessibility of European Creative Content on the World Market: The Need for Copyright Data Improvement in the Light of New Technologies

How We Add Value to Public Data With Better Curation And Documentation?

Many people ask if we can really add value to free data that can be downloaded from the Internet by anybody. We do not only work with easy-to-download data, but we know that free, public data usually requires a lot of work to become really valuable. To start with, it is not always easy to find.

Daniel Antal

Last updated on Nov 10, 2021 5 min read

How We Add Value to Public Data With Better Curation And Documentation?

How We Add Value to Public Data With Imputation and Forecasting

Public data sources are often plagued with missng values. Naively you may think that you can ignore them, but think twice: in most cases, missing data in a table is not missing information, but rather malformatted information which will destroy your beautiful visualization or stop your application from working. In this example we show how we increase the usable subset of a public dataset by 66.7%, rendering useful what would otherwise have been a deal-breaker in panel regressions or machine learning applications.

Daniel Antal

Last updated on Nov 11, 2021 7 min read

How We Add Value to Public Data With Imputation and Forecasting

The Data Sisyphus

Sisyphus was punished by being forced to roll an immense boulder up a hill only for it to roll down every time it neared the top, repeating this action for eternity. When was a file downloaded from the internet? What happened with it sense? Are their updates? Did the bibliographical reference was made for quotations? Missing values imputed? Currency translated? Who knows about it – who created a dataset, who contributed to it? Which is the final, checked, approved by a senior manager?

Daniel Antal

Jul 8, 2021 7 min read

The Data Sisyphus

Metadata

Adding metadata exponentially increases the value of data. Did somebody already adjust old data to conform to constantly changing geographic boundaries? What are some practical ways of combining satellite sensory data with my organization’s records? And do I have the right to do so? Metadata logs the history of data, providing instructions on how to reuse it, also setting the terms of use. We automate this labor-intensive process applying the FAIR data concept.

Jul 7, 2021

Metadata

Metadata

Uncut diamonds need to be cut, polished, and you have to make sure that they come from a legal source.

Daniel Antal

Last updated on Jul 8, 2021

Metadata