The Humanistic Buddhism Corpus v1.0 is a high-quality dataset containing 81,000 Chinese-English parallel phrases. For details about the HBC, please read the 2024 LREC-COLING conference paper titled Humanistic Buddhism Corpus: A Challenging Domain-Specific Dataset of English Translations for Classical and Modern Chinese.
Researchers may use this corpus under the Creative Commons license agreement: Attribution NonCommercial-ShareAlike CC BY-NC-SA. Go to Download HBC to fill out the agreement form. Information on how to download the corpus will be sent to you within 7 days after your submission of the form.