Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 627 Bytes

README.md

File metadata and controls

9 lines (7 loc) · 627 Bytes

abbreviation-extractor

The abbreviation-extractor is a tool designed to identify and extract abbreviations from PDF documents. This Rust implementation is inspired by the Schwartz-Hearst1 algorithm and is intended to be useful for researchers, scholars and people dealing with academic PDF content.

References

Footnotes

  1. Schwartz, Ariel & Hearst, Marti. (2003). A Simple Algorithm For Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 4. 451-62. 10.1142/9789812776303_0042.