Skip to content
@GutenbergSource

Project Gutenberg Sources

Source files, used to produce public domain ebooks posted to Project Gutenberg

About these Repositories

The Repositories in GutenbergSource are all my source files for ebooks I've submitted to Project Gutenberg. These source files are in the TEI format, and to be more precise in the now obsolete SGML format, following the P3 version of TEI (A version in XML is also included).

Each of these repositories has the following contents (some of these are optional).

  • <name>-<version>.tei -- The source file itself.
  • metadata.xml -- automatically generated metadata in RDF format.
  • README.md -- automatically generated readme in markdown format.
  • good_words.txt -- File generated by PGDP site, containing words marked as 'good'.
  • bad_words.txt -- File generated by PGDP site, continaing words marked as 'bad'.
  • project<hex-number>-comments.html -- Instructions as used at PGDP site.
  • tei2html.config -- Configuration for my tei2html tooling, used to create derived versions.
  • Processed -- a directory with processed results
    • <name>.html -- HTML file derived from the source file.
    • <name>.xml -- XML file in TEI format, derived from the source file.
    • <name>.txt -- Latin1 plain text file, manually derived from the source file.
    • <name>-utf8.txt -- UTF8 plain text file, manually derived from the source file.
    • images -- a directory with illustrations.
    • images@1 -- a directory with illustrations, at 144 DPI, maximum dimension 720px on longest edge.
    • images@2 -- a directory with illustrations, at 288 DPI, maximum dimension 1440px on longest edge.

The source file can be processed with the tooling made available in https://github.com/jhellingman/tei2html

Full instructions on how to use the scripts can be found in that repository.

Pinned Loading

  1. Information Information Public

    Some general information that applies all repositories in GutenbergSource.

    Perl

Repositories

Showing 10 of 644 repositories
  • 11167-Kincaid-Deccan-Nursery-Tales Public

    TEI source files of Charles Augustus Kincaid (1870–1954): Deccan Nursery Tales; or, Fairy Tales from the South.

    GutenbergSource/11167-Kincaid-Deccan-Nursery-Tales’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 1, 2025
  • 20502-Ginkel-De-Groote-Pyramide Public

    TEI source files of Hendricus Johannes van Ginkel (1880–1954): De Groote Pyramide

    GutenbergSource/20502-Ginkel-De-Groote-Pyramide’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 1, 2025
  • 28577-Barrows-The-Negrito-and-Allied-Types Public

    TEI source files of David Prescott Barrows (1873–1954): The Negrito and Allied Types in the Philippines and The Ilongot or Ibilao of Luzon.

    GutenbergSource/28577-Barrows-The-Negrito-and-Allied-Types’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 1, 2025
  • 38269-Barrows-A-History-of-the-Philippines Public

    TEI source files of David Prescott Barrows (1873–1954): A History of the Philippines.

    GutenbergSource/38269-Barrows-A-History-of-the-Philippines’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 1, 2025
  • 75000-O-Connor-Folk-tales-from-Tibet Public

    TEI source files of William Frederick Travers O’Connor (1870–1943): Folk tales from Tibet.

    GutenbergSource/75000-O-Connor-Folk-tales-from-Tibet’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 31, 2024
  • 74897-Davis-Chinese-fables-and-folk-stories Public

    TEI source files of Mary Hayes Davis (c. 1884–1948), Chow Leung: Chinese fables and folk stories.

    GutenbergSource/74897-Davis-Chinese-fables-and-folk-stories’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 16, 2024
  • 74894-Aristoteles-Zielkunde Public

    TEI source files of Aristoteles (384–322 v. Chr.): Aristoteles’ Zielenkunde.

    GutenbergSource/74894-Aristoteles-Zielkunde’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 16, 2024
  • 74781-De-Wit-De-drie-vrouwen-in-het-heilige-woud Public

    TEI source files of Augusta de Wit (1864–1939): De drie vrouwen in het heilige woud.

    GutenbergSource/74781-De-Wit-De-drie-vrouwen-in-het-heilige-woud’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 16, 2024
  • 74724-Tennyson-Henoch-Arden Public

    TEI source files of Alfred Tennyson (1809–1892): Henoch Arden

    GutenbergSource/74724-Tennyson-Henoch-Arden’s past year of commit activity
    HTML 0 0 0 0 Updated Nov 15, 2024
  • 74613-Dickens-Dombey-en-Zoon Public

    TEI source files of Charles Dickens (1812–1870): Dombey en Zoon.

    GutenbergSource/74613-Dickens-Dombey-en-Zoon’s past year of commit activity
    HTML 0 0 0 0 Updated Nov 15, 2024

Top languages

Loading…

Most used topics

Loading…