Learning to Spot the Revealing Gaps in Our Public Data Sets

“As art installations go, it is low key: a filing cabinet filled with meticulously labelled hanging folders. Visitors are welcome to browse under any heading that sparks their interest: publicly available gun trace data; the Nanjing massacre death toll; English language rules internalised by native speakers; how much Spotify pays each artist per play of song. The folders are all empty.

The work, titled “The Library of Missing Datasets”, is by Mimi Onuoha, an artist and adjunct professor at New York University. The aim, she says, is to expose the “blank spots in spaces that are otherwise saturated with data”. The blanks can reveal hidden biases in a society….”

Center for Open Data Enterprise (CODE)

“Open government data is a powerful tool for economic growth, social benefit, and scientific research. This global resource must be developed and managed in ways that meet the needs of the people and organizations that use it.

CODE brings together data providers and data users to develop better strategies that serve stakeholders and their common goals. 

CODE, founded as the Center for Open Data Enterprise, is a 501(c)3 nonprofit organization whose mission is to maximize the value of open government data as a resource for economic growth, social good, and scientific research….”

India’ Open Government Data Platform Is Helping Data Scientists Kick-start Their ML Journey

The NDA government has come into its new term with a renewed gusto towards analytics in the public sector. Recognising the disruptive effect that the upcoming AI wave will have on citizen’s day-to-day activities, the government has put it on a spotlight.

One of the biggest needs for a healthy analytics ecosystem in any given environment is data. Identifying the data-hungry nature of the new data science and analytics startups in India, the government initiated the Open Government Data Platform at data.gov.in….

This move allows data scientists and machine learning engineers alike to harness one of the biggest collections of datasets available to the public….”

White House Releases Draft Federal Data Strategy Action Plan – SPARC

Yesterday, the White House Office of Management and Budget (OMB) released their long-awaited draft Federal Data Strategy Action Plan which outlines the Administration’s concrete action plan for implementing the President’s Management Agenda priority to leverage data as a National Strategic Asset. It also serves as a blueprint for the government’s implementation of the Foundations for Evidence-Based Policymaking/Open Government Act, which was signed into law in January.

Along with the draft action plan, OMB released final versions of the principles and practices it expects agencies to follow in gathering, using, protecting, and engaging with data.

The draft action plan, which is open for public comment until July 5th, lays out actions considered fundamental for the government to undertake during the first year in order to execute the full breadth of the strategy over time. It includes concrete deliverables for each individual federal agency, as well as government-wide actions facilitated by collaborative agency work.

The plan articulates six actions for all federal agencies to individually complete once the action plan is finalized in August:

  1. Improve data resources for artificial intelligence research and development by February 2020
  2. Constitute a diverse data governance body by September 2019
  3. Assess data and related infrastructure maturity by May 2020
  4. Identify opportunities to increase staff data skills by May 2020
  5. Identify data needed to answer key agency questions by August 2020
  6. Identify priority datasets for agency open data plans by August 2020…”

The associativity evaluation between open data and country characteristics | The Electronic Library | Vol 37, No 2

Abstract:  Purpose

The purpose of this study is to review the levels of open government data (OGD) among various countries that are not consistent with the development levels of those countries. This study evaluates the associativity between OGD Index (OGD) and the characteristics of those countries as well as to compare the degree of OGD among countries. Accordingly, an advanced discussion to explore how a country’s characteristics affect how that country’s government opens data was presented.

 

Design/methodology/approach

The stakeholder relationships of OGD is analysed with the characteristics of a country. The usage data are compared with the data availability according to nine indicators. These data collected from the statistics and OGDI websites are grouped for comparative statistical analyses based on basic descriptive statistics, one-way analysis of variance and a regression model with variance inflation faction.

 

Findings

The results 1) revealed the reasons some countries have high-ranking indexes and 2) verified the high index values of countries in terms of their degrees of development. This study, thus, attempted to derive a balanced appraisal of national development and OGD.

 

Research limitations/implications

The study sample is limited only to countries 1) which open the statistical data; and 2) are of uneven population density and development degree. The OGDI is limited to expert evaluation. The score might be vary to experts and users with diverse countries at different evaluation period. The limitations can be attributed to the differences between OGDI and real open levels. These differences might influence the reliability and validity.

Practical implications
 

Government departments with OGD policies provide raw data in various formats and with application interfaces for user access. This study, thus, attempts to derive a balanced appraisal of national development and OGD. The factors that evaluate which types of countries open the level of data are explored.

Originality/value
 

This study establishes stakeholder relationships of OGD and extends to analyse the characteristics of a country and OGD that affect the government data open level. The relationships are evaluated through the OGDI with design score scheme. The measurement results indicated that a country possesses high relation to open data with high DI and nature resource.

Exploring the quality of government open data | Comparison study of the UK, the USA and Korea | The Electronic Library | Vol 37, No 1

Abstract:  Purpose

The use of “open data” can help the public find value in various areas of interests. Many governments have created and published a huge amount of open data; however, people have a hard time using open data because of data quality issues. The UK, the USA and Korea have created and published open data; however, the rate of open data implementation and level of open data impact is very low because of data quality issues like incompatible data formats and incomplete data. This study aims to compare the statuses of data quality from open government sites in the UK, the USA and Korea and also present guidelines for publishing data format and enhancing data completeness.

Design/methodology/approach

This study uses statistical analysis of different data formats and examination of data completeness to explore key issues of data quality in open government data.

Findings

Findings show that the USA and the UK have published more than 50 per cent of open data in level one. Korea has published 52.8 per cent of data in level three. Level one data are not machine-readable; therefore, users have a hard time using them. The level one data are found in portable document format and hyper text markup language (HTML) and are locked up in documents; therefore, machines cannot extract out the data. Findings show that incomplete data are existing in all three governments’ open data.

Originality/value

Governments should investigate data incompleteness of all open data and correct incomplete data of the most used data. Governments can find the most used data easily by monitoring data sets that have been downloaded most frequently over a certain period.

Open Government Partnership and the Open Data for Development Network join forces to support open…

The partnership will help advance the open data efforts in more than 60 OGP countries that have committed to implement ambitious open data principles.

Since the creation of the Open Government Partnership, Open Data commitments have been at the core of open government initiatives aiming to empowering government and civil society reformers to improve public services, reduce corruption, and harness technology to make government more efficient. The OGP 16 Paris Declaration recognizes that the increased availability of data is transforming the way citizens and governments interact, and is creating new opportunities for participation, responsiveness, and ongoing dialogue….”

Opinion | The Senate Should Reject Trump’s NOAA Nominee – The New York Times

The safety and economic well-being of Americans will be put at risk if the Senate confirms Barry Lee Myers as the next administrator of the National Oceanic and Atmospheric Administration.

As a nonscientist, Mr. Myers lacks the professional credentials to lead a science-centric agency responsible for daily weather forecasts, severe storm warnings, climate monitoring, fisheries management, coastal restoration and support for marine commerce.

As the former chief executive of the private weather-forecasting company AccuWeather, which relies on data from NOAA’s National Weather Service, he spent years trying to privatize NOAA’s public weather information so his company could profit from it. His family continues to run the family-owned company, raising concerns that they could benefit from decisions he might make as NOAA’s administrator….”