Blog
Blog
Exploring the world of open data: updates, insights, and innovations to empower data-driven solutions.
December 30, 2024
Introducing DataHub.io’s New Global Data Solutions
Discover how DataHub.io’s latest releases—Global Geo Data and the Worldwide Postal Code Database—empower organizations to innovate faster and more efficiently.
December 24, 2024
Technical Deep Dive: Our Global Postal Code Dataset & Roadmap
Discover the power of accurate, comprehensive postal code data with our Global Postal Code Dataset & Roadmap. Learn how we provide high-resolution coverage across 30+ countries, expand to 100+ countries by 2025, and ensure data quality with advanced validation pipelines. Perfect for logistics, geospatial analytics, and market expansion, our dataset offers bulk downloads, API access, and GIS compatibility. Explore our innovative crowdfunding model and practical use cases to transform your data-driven strategies.
December 23, 2024
Discover the Commodities Collection: Metals, Energy, Agriculture & More
Explore the new Commodities Collection on DataHub.io, featuring datasets on precious metals, energy resources, agricultural products, and livestock. Perfect for traders, researchers, and enthusiasts seeking market insights and trends.
December 13, 2024
Optimizing the Clinical Trials (US) Repository: Data Storage and Git LFS Solutions
The clinical-trials-us repository provides a critical resource, offering official U.S. clinical trial outcomes from the FDA. This data is vital for researchers, medical professionals, and policymakers. However, as the repository continues to grow, a key issue has surfaced regarding the best way to manage large datasets—specifically, the 2.3 GB of XML files sourced from ClinicalTrials.gov.
December 12, 2024
Celebrating 100 Stars on GitHub: R2 Bucket Uploader - Simplifying Cloudflare R2 File Uploads
We're excited to announce that our open-source library, R2 Bucket Uploader, has just reached 100 stars on GitHub! 🎉 This milestone is a testament to the value it provides to the community, and we wanted to take a moment to highlight its key features and how it simplifies integrating with Cloudflare R2 storage.
December 9, 2024
Kicking Off: Enhancing Football Datasets on Datahub.io
Discover the latest updates to Datahub.io's football datasets repository, including improved data processing workflows, expanded datasets, and automated updates.
November 7, 2024
Empowering Logistics with a Global Postal Code Data Solution
In the logistics industry, having accurate, up-to-date postal code data is crucial for smooth operations, especially when navigating complex international shipping requirements. In our recent project with a Fortune 500 logistics enterprise, we delivered a comprehensive postal code dataset solution designed to meet this need on a global scale.
October 21, 2024
Country List Dataset: Latest Update, Easy Access on DataHub.io, and Upcoming NPM Release
The Country List dataset is one of the most essential core datasets we maintain at DataHub. It provides a simple, up-to-date list of countries with their official English names and 2 digit codes (ISO 3166-1) in a developer-friendly CSV format.
October 1, 2024
Updating the Country Codes Open Dataset: A Major Overhaul
We are excited to share the latest updates on our open dataset, country-codes
. Over time, this dataset had become outdated, and the underlying codebase required a significant overhaul. To ensure it remains reliable and up-to-date, we embarked on a comprehensive update, restructuring the codebase and implementing new processes for regular maintenance.
September 27, 2024
What is next: Enhancing Dataset Discovery and Providing Core Data for the World
In the world of data, accessibility and quality are crucial. As we move into the next phase of DataHub.io, our goal is to make it the go-to place for finding essential and popular datasets. Alongside that, we're building a seamless experience for data publishers to upload and showcase their datasets. Over time, we envision this evolving into a vibrant data marketplace.
2024-09-25
Introducing a new endpoint for fetching raw data files from DataHub Cloud datasets
2024-07-10
Learn how to publish your Obsidian vault with DataHub Cloud
2024-05-20
Learn how to configure basic SEO fields and navigation bar in your DataHub Cloud sites
2024-05-17
Learn how to publish a dataset with DataHub Cloud
2024-05-03
Learn how to style your DataHub Cloud sites with custom CSS
2024-03-05
2023-12-12
Unveiling MarkdownDB's Latest Features: Export to JSON, task extraction, and computed fields 🚀
2023-10-11
Announcing MarkdownDB: an open source tool to create an SQL API to your markdown files! 🚀
MarkdownDB - an open source library to transform markdown content into sql-queryable data. Build rich markdown-powered sites easily and reliably. New dedicated website at markdowndb.com
2023-05-30
Create a catalog of anything using Markdown files in Obsidian
2023-05-29
Quarto: A tool to publish Jupyter notebooks as static websites
2023-04-18
Exporting Wikidata with SPARQL and ChatGPT
2023-04-01
Tutorial: Publishing data rich documents on DataHub
2023-02-13
We have some important updates re Datahub.io!
2022-03-14
Generate an interactive webpage from CSV data and markdown
2021-06-22
A Short Case Study Involving Table Schema Frictionless Specs at the European Union
2021-02-19
A Vision for the next generation of the DataHub (v3)
An overview of the next generation of the DataHub. We want to make it incredibly easy, fast and reliable to share your data in a useable way.\n
2020-05-08
COVID-19 and Compartmental Models in Epidemiology
2020-03-17
Open Data Day 2020 and COVID-19 data
2020-03-08
Comparotron: A simple way to visualize and share comparisons
2018-09-10
2018-09-05
Automatically updated core datasets on DataHub
2018-08-31
2018-08-23
Attribute Relation File Format (ARFF)
2018-07-18
How to use multiple DataHub accounts
2018-07-16
World Bank Indicators on DataHub
2018-07-10
Automated KPIs collection and visualization of the funnels
2018-06-11
Revamped awesome collections: data sets that are grouped by subject
2018-05-25
2018-05-23
Auto-publish your datasets using Travis-CI
2018-05-15
JavaScript SDK for data deployment
2018-05-14
How to initialize a data package using data tool
2018-04-19
Validate your Data Package descriptor online
2018-04-11
2018-03-26
2018-01-29
Improved Reporting and Debugging of Data Publishing
2018-01-24
Data Validation in the DataHub
2018-01-23
Which country spends the most on pharmaceutical drugs?
2017-12-13
Introducing private datasets on the DataHub
2017-12-01
Data desktop app - alpha release with drag and drop data publishing support
2017-11-16
How to use Data Packages from R
2017-11-14
Import online data files directly with scheduling
2017-11-03
Core Data: Essential Datasets for Data Wranglers and Data Scientists
2017-10-31
See events and activity related to datasets or publishers
2017-10-19
2017-10-18
2017-10-17
Vega views upgrade - now using v3
2017-10-16
Excel Files on the DataHub: Automated Previews and Data Extraction
2017-10-11
Data Package v1 Specifications. What has Changed and how to Upgrade