From scanned page to usable knowledge
Our digital-humanities workflow connects document scanning and OCR, text correction, structured metadata, full-text indexing, evidence-based retrieval, maps, knowledge cards and public reference pages.
The aim is not to replace close reading. It is to make large and difficult collections easier to navigate while preserving citations, uncertainty, and the distinction between source text and interpretation.
Current areas of work include historical Chinese phonology, Chinese character reference, the Siku Quanshu catalogue tradition, local gazetteers, dialect geography and endangered-language materials.
Public projects
Chinese Dialect Maps: https://dialect-maps.pages.dev/
Selected interactive maps turn fieldwork and dialect-reading data into geographical research interfaces.
Gazetteer Knowledge Base: https://gazetteer-wiki.pages.dev/
A curated public reference for selected places, people and texts found in local gazetteers.
Principles: provenance before automation; searchable evidence rather than unsupported summaries; clear separation of source, interpretation and hypothesis; public access to reviewed outputs without exposing restricted source collections.