How a solo archival system gets built in public
Updated: 2026-04-13
Completed so far
scripts/collector.pyHISTORY_DIREVENTS_FILEPARSER_VERSION = "2.3"ensure_dirs() now creates csj_history/stable_json_dumps()normalize_text_for_diff()normalize_list_for_diff()build_comparable_record()hash_value()compute_field_hashes()compute_content_hash()diff_job_records()OPERATIONAL_FIELDSCRITICAL_FIELDSTEXT_FIELDSnormalize_job() to include:
parser_versionlifecycle_statusfirst_missing_atconsecutive_missing_runslast_changed_atfield_hashescontent_hashexisting_record instead of only first_seenValidated
python3 ~/.hermes/skills/research/civil-service-jobs-collector/scripts/collector.py --details -w 1 -n 1 --full --forceparser_version: "2.3"lifecycle_status: "active"field_hashescontent_hashNot yet implemented
csj_history/csj_events.jsonlRecommended next step