Phase 2: Translation

The Osho's Ashtavakra Gita Project

AshtavakraOsho

A structured, AI-ready dataset of Ashtavakra MahaGita teachings based on the discourses of Osho.

Designed for semantic search, AI/LLM applications, knowledge graph construction, and bilingual study.

About the Project

This project transforms raw Hindi discourse text into a clean, structured corpus suitable for modern AI applications. Based on Osho's profound teachings on the Ashtavakra MahaGita.

The current focus is on building a robust data foundation layer — ensuring high-quality chunking and consistency before moving to translation and AI layers.

{
  "id": "AAG_C01_P001",
  "chapter_no": 1,
  "chunk_index": 1,
  "text_hi": "जनक उवाच...",
  "text_en": "Janaka said...",
  "translation_status": "ai_draft"
}

91 Lessons

Dive into the profound wisdom of Ashtavakra MahaGita through 91 transformative chapters

Loading chapters...

Features

Semantic Search

Find teachings instantly with AI-powered search across all chapters

Bilingual Reading

Study in Hindi and English with side-by-side translations

RAG Ready

Structured data optimized for AI and LLM applications

Open Source

Free for everyone - researchers, developers, and seekers

Roadmap

Raw Hindi text ingestion
Text normalization
Sentence-safe chunking
Structured JSON generation
AI-assisted translation
Embeddings & semantic search
Web reader interface

Architecture Vision

Consistency at the data layer is critical for everything downstream.

RAW TEXTDATAEMBEDAI