Blog

Blog

Exploring the world of open data: updates, insights, and innovations to empower data-driven solutions.


December 30, 2024

Introducing DataHub.io’s New Global Data Solutions

Discover how DataHub.io’s latest releases—Global Geo Data and the Worldwide Postal Code Database—empower organizations to innovate faster and more efficiently.


December 24, 2024

Technical Deep Dive: Our Global Postal Code Dataset & Roadmap

Discover the power of accurate, comprehensive postal code data with our Global Postal Code Dataset & Roadmap. Learn how we provide high-resolution coverage across 30+ countries, expand to 100+ countries by 2025, and ensure data quality with advanced validation pipelines. Perfect for logistics, geospatial analytics, and market expansion, our dataset offers bulk downloads, API access, and GIS compatibility. Explore our innovative crowdfunding model and practical use cases to transform your data-driven strategies.


December 23, 2024

Discover the Commodities Collection: Metals, Energy, Agriculture & More

Explore the new Commodities Collection on DataHub.io, featuring datasets on precious metals, energy resources, agricultural products, and livestock. Perfect for traders, researchers, and enthusiasts seeking market insights and trends.


December 13, 2024

Optimizing the Clinical Trials (US) Repository: Data Storage and Git LFS Solutions

The clinical-trials-us repository provides a critical resource, offering official U.S. clinical trial outcomes from the FDA. This data is vital for researchers, medical professionals, and policymakers. However, as the repository continues to grow, a key issue has surfaced regarding the best way to manage large datasets—specifically, the 2.3 GB of XML files sourced from ClinicalTrials.gov.


December 12, 2024

Celebrating 100 Stars on GitHub: R2 Bucket Uploader - Simplifying Cloudflare R2 File Uploads

We're excited to announce that our open-source library, R2 Bucket Uploader, has just reached 100 stars on GitHub! 🎉 This milestone is a testament to the value it provides to the community, and we wanted to take a moment to highlight its key features and how it simplifies integrating with Cloudflare R2 storage.


December 9, 2024

Kicking Off: Enhancing Football Datasets on Datahub.io

Discover the latest updates to Datahub.io's football datasets repository, including improved data processing workflows, expanded datasets, and automated updates.


November 7, 2024

Empowering Logistics with a Global Postal Code Data Solution

In the logistics industry, having accurate, up-to-date postal code data is crucial for smooth operations, especially when navigating complex international shipping requirements. In our recent project with a Fortune 500 logistics enterprise, we delivered a comprehensive postal code dataset solution designed to meet this need on a global scale.


October 21, 2024

Country List Dataset: Latest Update, Easy Access on DataHub.io, and Upcoming NPM Release

The Country List dataset is one of the most essential core datasets we maintain at DataHub. It provides a simple, up-to-date list of countries with their official English names and 2 digit codes (ISO 3166-1) in a developer-friendly CSV format.


October 1, 2024

Updating the Country Codes Open Dataset: A Major Overhaul

We are excited to share the latest updates on our open dataset, country-codes. Over time, this dataset had become outdated, and the underlying codebase required a significant overhaul. To ensure it remains reliable and up-to-date, we embarked on a comprehensive update, restructuring the codebase and implementing new processes for regular maintenance.


September 27, 2024

What is next: Enhancing Dataset Discovery and Providing Core Data for the World

In the world of data, accessibility and quality are crucial. As we move into the next phase of DataHub.io, our goal is to make it the go-to place for finding essential and popular datasets. Alongside that, we're building a seamless experience for data publishers to upload and showcase their datasets. Over time, we envision this evolving into a vibrant data marketplace.


2024-09-25

Introducing a new endpoint for fetching raw data files from DataHub Cloud datasets


2024-07-10

Learn how to publish your Obsidian vault with DataHub Cloud


2024-05-20

Learn how to configure basic SEO fields and navigation bar in your DataHub Cloud sites


2024-05-17

Learn how to publish a dataset with DataHub Cloud


2024-05-03

Learn how to style your DataHub Cloud sites with custom CSS


2024-03-05

DataHub Cloud Launch on Open Data Day: Build elegant data-driven sites with markdown & deploy in seconds


2023-12-12

Unveiling MarkdownDB's Latest Features: Export to JSON, task extraction, and computed fields 🚀


2023-10-11

Announcing MarkdownDB: an open source tool to create an SQL API to your markdown files! 🚀

MarkdownDB - an open source library to transform markdown content into sql-queryable data. Build rich markdown-powered sites easily and reliably. New dedicated website at markdowndb.com


2023-05-30

Create a catalog of anything using Markdown files in Obsidian


2023-05-29

Quarto: A tool to publish Jupyter notebooks as static websites


2023-04-18

Exporting Wikidata with SPARQL and ChatGPT


2023-04-01

Tutorial: Publishing data rich documents on DataHub


2023-02-13

We have some important updates re Datahub.io!


2022-03-14

Generate an interactive webpage from CSV data and markdown


2021-06-22

A Short Case Study Involving Table Schema Frictionless Specs at the European Union


2021-02-19

A Vision for the next generation of the DataHub (v3)

An overview of the next generation of the DataHub. We want to make it incredibly easy, fast and reliable to share your data in a useable way.\n


2020-05-08

COVID-19 and Compartmental Models in Epidemiology


2020-03-17

Open Data Day 2020 and COVID-19 data


2020-03-08

Comparotron: A simple way to visualize and share comparisons


2018-09-10

New Machine Learning Datasets


2018-09-05

Automatically updated core datasets on DataHub


2018-08-31

Sports data on DataHub


2018-08-23

Attribute Relation File Format (ARFF)


2018-07-18

How to use multiple DataHub accounts


2018-07-16

World Bank Indicators on DataHub


2018-07-10

Automated KPIs collection and visualization of the funnels


2018-06-11

Revamped awesome collections: data sets that are grouped by subject


2018-05-25

Machine learning datasets


2018-05-23

Auto-publish your datasets using Travis-CI


2018-05-15

JavaScript SDK for data deployment


2018-05-14

How to initialize a data package using data tool


2018-04-19

Validate your Data Package descriptor online


2018-04-11

Q1 2018 Review


2018-03-26

New Features and Improvements


2018-01-29

Improved Reporting and Debugging of Data Publishing


2018-01-24

Data Validation in the DataHub


2018-01-23

Which country spends the most on pharmaceutical drugs?


2017-12-13

Introducing private datasets on the DataHub


2017-12-01

Data desktop app - alpha release with drag and drop data publishing support


2017-11-16

How to use Data Packages from R


2017-11-14

Import online data files directly with scheduling


2017-11-03

Core Data: Essential Datasets for Data Wranglers and Data Scientists


2017-10-31

See events and activity related to datasets or publishers


2017-10-19

Datasets in zip format


2017-10-18

Previews for large datasets


2017-10-17

Vega views upgrade - now using v3


2017-10-16

Excel Files on the DataHub: Automated Previews and Data Extraction


2017-10-11

Data Package v1 Specifications. What has Changed and how to Upgrade


© 2024 All rights reservedBuilt with Find, Share and Publish Quality Data with Datahub

Built with Find, Share and Publish Quality Data with DatahubFind, Share and Publish Quality Data with Datahub