Publications

JAPAS: A Benchmark and Neural Approach for Japanese Patent S upport Relation Extraction

Efficient analysis of patent literature is crucial for technological development and protecting intellectual property. Akey task is verifying the “support requirement,” which mandates that the detailed description must fully describe theclaimed invention. This requirement is fundamental to a patent’s validity. Manual verification is a labor-intensiveprocess that demands technical and legal expertise, making automation highly desirable. However, research onthis task has been hampered by two key challenges: (1) the absence of a public benchmark, and (2) the reliance ofprior work on lexical matching, which fails to capture semantic equivalence. To address these issues, we introduceJAPAS, the first public benchmark for this task, comprising over 2,000 instances manually annotated for Japanesepatents. Each instance is labeled with a claim span, a supporting description paragraph, a relation type, and theannotator’s confidence level. Using this benchmark, we also establish modern baselines that capture semanticsimilarity, such as embeddings and LLMs. Our experiments show that a fine-tuned Qwen3-14B model achieves anF1 score of 0.50, outperforming the conventional lexical-based baseline. This result, which demonstrates that thetask is feasible yet challenging, highlights the utility of JAPAS as a research foundation and provides a performancetarget for future work.

自然言語推論と再現器を用いたSplit and Rephraseにおける生成文の品質向上
JParaCrawl v3.0: 大規模日英対訳コーパス

委員特別賞

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP
Incorporating Noisy Length Constraints into Transformer with Length-aware Positional Encodings
Bilingual Text Extraction as Reading Comprehension
漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳
漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳
Positional Encoding出力長制御を用いた英日ニューラル機械翻訳の検討
Simultaneous Neural Machine Translation using Connectionist Temporal Classification
漸進的な音声認識・機械翻訳・テキスト音声合成に基づく音声から音声への同時翻訳
英日同時翻訳のためのConnectionist Temporal Classificationを用いたニューラル機械翻訳
英日同時通訳におけるニューラル機械翻訳の検討
単語分散表現に基づいた誤差によるニューラル機械翻訳の学習
Training Neural Machine Translation using Word Embedding-based Loss
転移学習による画像説明文の選択
深層学習を用いた話者認識
深層学習を用いた話者認識