Skip to content

cverluise/patentcity

Repository files navigation

patentCity


🗃️ This repository is the codebase of the patentCity database.

📚 The patentCity database is a comprehensive database reporting patentees' data extracted from patent texts as well as enriched data since the 19th century in Germany (including East Germany), France, Great Britain and the USA.

💥 The goal of the database is to spur research on the history of innovation and to deepen the set of natural experiments from which historians, economists and scientists in general can learn to improve our understanding of innovation dynamics.

🌎 Explore and visualize the patentCity database online at patentcity.xyz (click on the map under the "Explore" section).

📥 Download the patentCity database here (CC-BY-4).

📝 The patentCity codebase and database are documented at doc.patentcity.xyz.

🤗 We open source our code (MIT) to support future extensions, and a collaborative way to create and continuously improve research databases.

💌 patentCity is due to expand and improve continuously in the coming years. Make sure to receive updates, join our newsletter and star the GitHub repository!

©️ patentCity is the backbone of Bergeaud and Verluise (2022). If you use the data or the codebase, make sure to cite the paper.

Developer guide

Installation and set-up

# Clone repo
git clone https://github.com/cverluise/patentcity.git

# set up env/ install dependencies and CLI
cd patentcity/
poetry install

Install poetry if needed.

# get data & models (dev-only, access to gcp bucket needed)
dvc pull

Install dvc if needed.

API doc

The API doc is available at the documentation website under the API section.

Overall, the API is thought to be versatile and to adapt to new documents seamlessly. The main functions are covered by the patencity CLI