Note: This page is for system administrators. To read texts online please see the Collections/Texts page.
Update May 20, 2016: Please note that we consider the source code deprecated and can offer no customized support towards installation. We are working towards a significant site restructuring and will post more updates in the upcoming months.
Update May 27, 2011: A new release of the source code has been added to SourceForge. New texts have also been added to the data. If you want to use the latest source code, you may also want to get the latest data from below.Perseus' Java Hopper
Source Code - The source code can be downloaded from SourceForge.net
Text Files - These are the original XML text files. Download these files if you are generating the data for the hopper yourself. Texts are licensed under the Creative Commons ShareAlike 3.0 License
- Download all texts (447 MB)
- Download individual collections of texts. NOTE: If you download individual collections, place the directories downloaded in /sgml/texts/. Some files may be duplicated in the different collections.
Data - Download these .tar.gz files if you prefer to use the provided database dumps and other generated data.
- Individual MySQL dumps:
- hib_artifact_keywords (132 KB)
- hib_artifacts (340 KB)
- hib_atomic_artifacts (398 KB)
- hib_building_artifacts (74 KB)
- hib_chunks (166 MB)
- hib_citations (5.3 MB)
- hib_coin_artifacts (101 KB)
- hib_date_ranges (17 KB)
- hib_dates (300 KB)
- hib_entities (20 MB)
- hib_entity_occurrences (53 MB)
- hib_frequencies (559 MB)
- hib_gem_artifacts (1.9 KB)
- hib_image_names (584 KB)
- hib_images (2.5 MB)
- hib_lang_abbrevs (948 bytes)
- hib_languages (986 bytes)
- hib_lemmas (2.2 MB)
- hib_parses (13 MB)
- hib_person_names (3.6 MB)
- hib_places (11 MB)
- hib_sculpture_artifacts (377 KB)
- hib_site_artifacts (72 KB)
- hib_toc_chunks (8.2 MB)
- hib_tocs (43 KB)
- hib_vase_artifacts (1.4 MB)
- hib_word_counts (54 KB)
- metadata (412 KB)
- morph_frequencies (93 KB)
- morph_votes (14.9 MB)
- prior_frequencies (4.6 MB)
- sense_votes (7.3 MB)
- senses (2.4 MB)
- Download processed XML texts and cache files. These directories go in /sgml/xml/.
- Download Lucene indexes (229 MB). This directory goes in /sgml/reading/.
Perseus' Art & Archaeology Module
The Art & Archaeology data and source code is now included with the Perseus hopper.