Inkuntri
Japanese Research, tools & pedagogy

A Research Stack for Japanese Learners: Corpora, Dictionaries, White Papers, Archives

The reader can assemble a Japanese research stack using corpora, dictionaries, official white papers, archives, news databases, and domain sources.

Published January 7, 2026 Japanese

Core examples: コーパス, 辞書, 白書, 統計, 公文書, アーカイブ, 判例, ニュースデータベース, 用例, 出典, 検索語, 信頼性.

Serious learners need sources, not just search results

A learner wants to know whether:

ご確認いただけますと幸いです

is natural. They search the web and find examples. But where are those examples from? Business emails? Templates? Machine-translated pages? Learner blogs? Official documents? Spam?

Another learner wants to understand:

関係人口

They find social media posts and travel ads, but miss municipal and policy usage.

A research stack solves this problem.

The key principle is:

Advanced Japanese learning depends on source discipline.

You need different tools for different questions.

コーパス

コーパス

corpus.

Use for:

  • usage examples,
  • frequency,
  • collocation,
  • genre comparison,
  • spoken/written contrast,
  • register checks.

Good questions:

What verbs co-occur with 対策? Is this phrase common in newspapers or conversation? What particles follow this expression?

Bad question:

What does this word mean in all contexts?

A corpus gives evidence, not final interpretation.

辞書

辞書

dictionary.

Use several types:

  • 国語辞典,
  • 和英辞典,
  • 漢和辞典,
  • 類語辞典,
  • アクセント辞典,
  • 専門辞典.

Good questions:

What are the senses? What is the reading? What are near-synonyms? Is this pitch pattern heiban or atamadaka?

Bad question:

Which English word should I always use?

白書

白書

white paper.

Use for:

  • policy vocabulary,
  • official framing,
  • statistics,
  • government categories,
  • current/social issues,
  • institutional language.

Examples:

防衛白書 defense white paper

労働経済白書 labor economy white paper

環境白書 environment white paper

Learner action: white papers are official views, not neutral reality.

統計

統計

statistics.

Use official statistics for:

  • population,
  • labor,
  • economy,
  • education,
  • health,
  • tourism,
  • surveys.

Good questions:

What metric is used? What is the unit? What is the comparison period? Is it seasonally adjusted?

Statistics teach data Japanese and caution.

公文書

公文書

official/public documents.

Use for:

  • administrative terms,
  • forms,
  • laws,
  • procedures,
  • public notices,
  • historical records.

Good questions:

How does the government phrase this? What is the official document name? What action is required?

Learner action: official documents are excellent for procedural Japanese.

アーカイブ

アーカイブ

archive.

Use for:

  • historical newspapers,
  • old documents,
  • public records,
  • language change,
  • media history,
  • old spellings,
  • institutional memory.

Archives help when you need historical evidence, not only modern usage.

判例

判例

court precedent/case law.

Use for legal Japanese research with caution.

Good for:

  • legal terms,
  • court phrasing,
  • issue/holding structure,
  • party roles,
  • statutory interpretation.

Caution: not for casual learner interpretation in real disputes.

Learner action: read legal materials as language only unless professionally qualified.

ニュースデータベース

ニュースデータベース

news database.

Use for:

  • headline patterns,
  • current vocabulary,
  • discourse over time,
  • source comparison,
  • public event tracking.

News databases are better than random search when you need traceable journalism.

用例 and 出典

用例

usage example.

出典

source.

A serious note should include both.

Weak note:

周知 = make known.

Strong note:

周知する: to inform/make known to relevant people; common in official/workplace contexts. Example source: municipal notice, school announcement, company compliance email.

Learner action: record source context.

検索語

検索語

search term.

Good Japanese research depends on good search terms.

For a topic like child allowance, search:

子育て支援 給付金 申請 対象者 必要書類

not:

Japanese child money paper

Use Japanese terms from the domain.

信頼性

信頼性

reliability/credibility.

Evaluate sources by:

  • authority,
  • genre,
  • date,
  • purpose,
  • expertise,
  • citation,
  • domain,
  • bias,
  • whether it is original or copied,
  • whether it is user-generated.

Learner action: reliability depends on question. A social-media post is good evidence for slang, weak evidence for law.

Research stack by question

QuestionBest source
What does this word mean?dictionary
How is it used?corpus/examples
Is it formal or casual?corpus by genre + examples
What is official wording?government/official documents
What is policy framing?white papers
What is current public reporting?news database
What is legal usage?statutes, cases, legal dictionaries
What is historical use?archives
What is technical meaning?specialist glossary/docs
What is natural in conversation?spoken corpus/transcripts

Scenario 1: checking a business phrase

Question:

Is ご確認いただけますと幸いです natural?

Use:

  1. business email examples,
  2. corpus examples,
  3. style guides,
  4. compare with ご確認ください and ご確認お願いいたします.

Record:

  • formal written request,
  • soft/burden-sensitive,
  • may be too stiff for casual chat.

Scenario 2: policy term

Question:

What is 関係人口?

Use:

  1. white papers,
  2. municipal pages,
  3. regional revitalization reports,
  4. news features.

Record:

  • related population,
  • policy term between resident and visitor,
  • used in rural revitalization.

Scenario 3: kanji compound

Question:

How to understand 適合性?

Use:

  1. dictionary,
  2. technical standards examples,
  3. JIS/compliance pages,
  4. collocation search.

Record:

  • conformity/suitability depending domain,
  • common with 基準, 規格, 検査.

Example bank walkthrough

コーパス

Corpus.

Learner action: usage evidence.

辞書

Dictionary.

Learner action: meaning and form.

白書

White paper.

Learner action: official policy framing.

統計

Statistics.

Learner action: data and metrics.

公文書

Official document.

Learner action: institutional wording.

アーカイブ

Archive.

Learner action: historical evidence.

判例

Case law.

Learner action: legal-language source, caution.

ニュースデータベース

News database.

Learner action: journalistic usage.

用例

Usage example.

Learner action: context.

出典

Source.

Learner action: evidence tracking.

検索語

Search term.

Learner action: Japanese query design.

信頼性

Reliability.

Learner action: source evaluation.

Research workflow

When researching Japanese usage:

  1. Define the question.
  2. Choose source type.
  3. Search using Japanese terms.
  4. Check multiple examples.
  5. Record source and genre.
  6. Compare dictionary and corpus.
  7. Check official/domain source if needed.
  8. Mark reliability.
  9. Write a plain conclusion.
  10. Record uncertainty.

Source stack matching table

Choose tools by question.

QuestionSource stack
What does it mean?dictionary + examples
Is it natural?corpus + native examples
Is it formal?genre comparison
Is it official wording?government/official documents
Is it policy language?white papers + statistics
Is it current news?news database
Is it legal usage?statute/case/legal dictionary
Is it historical?archive
Is it technical?specialist glossary/docs
Is it spoken?spoken corpus/transcripts

Source choice is part of the answer.

Reliability ladder

SourceBest forCaution
official documentinstitutional wordingofficial framing
white paperpolicy vocabularygovernment perspective
corpususage evidencecorpus composition
dictionarymeaning/sensegloss limits
news databasepublic reportingeditorial framing
archivehistorical evidenceold usage
social mediaslang/stanceunreliable for facts
AI/MT outputcomparison aidmust verify

No source is universally best.

Research note template

For each researched term, record:

question search terms sources checked example sentence conclusion register/domain uncertainty next source to check

This prevents “I saw it online” from becoming false confidence.

A strong tool for this article would match questions to source types.

Suggested functions:

  1. Question-type selector.
  2. Recommended source stack.
  3. Search-term builder.
  4. Source reliability checklist.
  5. Example/source note fields.
  6. Domain caution flags.
  7. Conclusion with uncertainty field.

Final rule

Advanced Japanese learning needs evidence.

コーパス gives usage. 辞書 gives definitions. 白書 gives policy framing. 統計 gives data. 公文書 gives official language. アーカイブ gives history. 判例 gives legal phrasing. ニュースデータベース gives public reporting. 用例 and 出典 keep you honest.

Do not just search. Research.

Related reading