Changelog
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
[Unreleased]
Changed
API key environment variable: Updated all code to use
LLM_API_KEYinstead ofOPENAI_API_KEYfor consistency with repository secrets configuration. This affects:src/honegumi_rag_assistant/app_config.py- Settings class now reads fromLLM_API_KEYsrc/honegumi_rag_assistant/nodes/code_writer.py- Error message updatedsrc/honegumi_rag_assistant/extractors.py- Error message updatedsrc/honegumi_rag_assistant/build_vector_store.py- Environment variable check updatedscripts/test_vector_store.py- Environment variable check updatedscripts/run_rag_experiments.py- Documentation comments updated
Added
Two-stage parameter extraction with solution-format structure: Implemented chain-of-thought reasoning for grid parameter selection. Stage 1 extracts explicit problem structure using
ProblemStructureExtractorin the same format as test problem solutions (search_space, objective, budget, batch_size, noise_model, constraints). Stage 2 uses this structured representation to make grid selections via enhancedParameterExtractor. This approach ensures consistency with validation expectations and improves accuracy through explicit reasoning.Solution-format Pydantic models: Added
SearchSpaceParameter,ObjectiveSpec,ConstraintSpec, andProblemStructuremodels that mirror the solution structure from test problems, ensuring perfect alignment between extraction and validation.Enhanced debug output: Parameter selector now shows both Stage 1 (problem structure in solution format) and Stage 2 (grid selections) with clear formatting for better transparency and debugging.
Detailed grid selection rules: Enhanced prompt includes explicit lookup table mapping extracted structure to grid parameters with special emphasis on constraint type distinctions.
Problem statement collection: Created
data/raw/problem_statements.yamlwith initial problem statement for ceramic sintering optimization. Includes natural version (underspecified), corrected version (conversational with density units), Honegumi-specific grid selections (objective, model, task, constraints), links to relevant Honegumi tutorials (SOBO, Batch Fully Bayesian), and references to related Ax repository issues. Template for collecting 100 problem statements across different personas in physical sciences.RAG experiment infrastructure: Created
data/raw/rag_assistant_runs.yamlfor tracking RAG assistant experiments with experiment IDs, prompts, results, and log correlations. Addedscripts/run_rag_experiments.pyto run experiments programmatically and capture intermediate grid selections, generated scripts, and terminal logs. Addedscripts/run_experiments_and_upload.shbash wrapper. Addeddata/raw/README_RAG_EXPERIMENTS.mddocumenting experiment structure and artifact locations. Updated to work with GitHub Actions environment secrets.
Changed
ProblemStructure schema: Now follows exact solution format with
search_space(list of parameters),objective(list of objectives),budget,batch_size,noise_model,constraints,historical_data_points, andmodel_preference- matching test problem structure.Constraint distinction: Clear separation between
composition_constraint(fractions sum to 1.0 for materials) andsum_constraint(general sum constraints) with explicit total value tracking viatotalfield instead oftarget_value.Parameter types: Simplified to
continuousandcategorical(removedinteger) to match solution format expectations.ParameterExtractor prompt: Includes formatted table showing grid selection logic with pre-computed boolean flags for each constraint type based on extracted structure, making the mapping explicit and deterministic.
Parameter selector workflow: Two-stage extraction (structure → grid) with solution-format intermediate representation instead of direct natural language → grid mapping.
Fixed
Removed API key existence checks from experiment runner scripts - secrets are available in GitHub Actions runtime but not in Copilot agent sandbox.
[0.1.7] - 2025-10-20
Added
Settings reload method: Added
settings.reload_from_env()method to allow reloading configuration from environment variables after module import. Fixes Colab/Jupyter notebook issue where settings were cached before environment variables were set.
Changed
Colab tutorial improvements: Updated Step 5 to explicitly call
settings.reload_from_env()and print confirmation of vector store path. Ensures vector store is properly detected in notebooks.Build script error message: Updated orchestrator error message to use
python -m honegumi_rag_assistant.build_vector_storeinstead of outdatedscripts/build_vector_store.pypath.
[0.1.6] - 2025-10-20
Fixed
Vector store Ax version pinning:
src/honegumi_rag_assistant/build_vector_store.pynow defaults to cloning Ax v0.4.3 (matching honegumi dependency) instead of latest main branch. This prevents documentation version mismatch where LLM receives v1.x docs but generates v0.4.3 code. New--ax-versionparameter allows specifying different versions when needed. Addresses issue raised in PR #3.Vector store missing condition now properly detected when path is configured but directory doesn’t exist
Changed
Build script location: Moved
build_vector_store.pyfromscripts/tosrc/honegumi_rag_assistant/to make it available as a module for pip-installed users. Usage:python -m honegumi_rag_assistant.build_vector_storeRetrieval UX improvements: Simplified retrieval output to show only essential information. In non-debug mode, displays “Planning X parallel retrievals…” followed by success message with context count and timing, or clear error if vector store missing. Removes verbose query details and individual retriever timings from default output.
Vector store metadata: Build script now saves
metadata.jsonwith build details including Ax version, build date, chunk configuration, and document counts for better provenance tracking.Removed verbose per-query retrieval details from non-debug output
Cleaner success message: “✓ Retrieved X contexts in Y.XXs”
Added
Clear warning when vector store is not found: prompts user to build it
Vector store path existence check before attempting retrieval
[0.1.5] - 2025-10-20
Changed
Simplified ReadTheDocs navigation structure (removed unnecessary sections)
Reduced documentation table of contents depth from 2 to 1 for cleaner appearance
Removed License and Authors pages from main navigation
Documentation
Added interactive CLI usage example showing the prompt and user input flow
Improved Quick Start section with clearer instructions
Renamed “Contents” to “Documentation” for better clarity
[0.1.4] - 2025-10-20
Added
Project logo in ReadTheDocs sidebar
ReadTheDocs badge to README.md tracking stable version
Enhanced ReadTheDocs theme options (version display, external link styling)
Changed
Updated project name from “honegumi_rag_assistant” to “Honegumi RAG Assistant” in documentation
Replaced Features section with Key Capabilities from README in docs/index.md
Added project overview and description to documentation landing page
Updated README.md badge to track stable version instead of latest
Improved documentation landing page with logo and better structure
Documentation
Set ReadTheDocs default version to
stablefor production useConfigured logo to appear in documentation sidebar
Enhanced theme configuration for better user experience
[0.1.3] - 2025-10-20
Added
Moved
build_vector_store.pyfromscripts/tosrc/honegumi_rag_assistant/for pip package accessibilityConfigured PyPI to use
README_PYPI.mdfor concise package descriptionReadTheDocs configuration with Google-style docstring support
Sphinx ReadTheDocs theme
Napoleon extension configured for Google docstrings
Comprehensive documentation structure
Changed
Updated
setup.cfgto useREADME_PYPI.mdas long_descriptionEnhanced
docs/index.mdwith project description and featuresUpdated
docs/requirements.txtwith all package dependenciesChanged documentation theme from Alabaster to ReadTheDocs
[0.1.1] - 2025-10-20
Added
Google Colab tutorial notebook (
notebooks/honegumi_rag_colab_tutorial.ipynb)README_PYPI.mdfor PyPI package pagePyPI badges to
README.md(Colab, Issues, Discussions, Last Commit)“Google Colab Tutorial” section in README.md
“Feedback & Feature Requests” section in README.md
requirements.txtwith exact version pinning for reproducibility
Changed
Updated
setup.cfgwith all dependencies using exact versions (==)Added console script entry point:
honegumi-ragcommandEnhanced README.md with comprehensive installation and usage instructions
Colab notebook uses programmatic API only (CLI doesn’t work interactively)
Fixed
Package structure to include
build_vector_store.pyin pip installation
[0.1.0] - 2025-10-20
Added
Initial release of Honegumi RAG Assistant
Multi-agent LangGraph pipeline for code generation
RAG-based retrieval from Ax Platform documentation
FAISS vector store for fast document retrieval
Specialized agents: IssueScout, ParameterSelector, RetrievalPlanner, CodeWriter, Reviewer
CLI interface with
honegumi-ragcommandProgrammatic API via
run_from_text()andrun_from_dict()Debug mode for detailed execution logging
Optional code review and refinement step
Support for custom output directories
Changed
Made file saving optional (only when
--output-dirspecified)Removed all emoji characters from output for cleaner UX
Added visual spacing between user input and processing output
Moved startup banners and timing to debug-only mode
Technical
Built on Honegumi for deterministic skeleton generation
Uses OpenAI GPT models (configurable GPT-4o and GPT-o1)
Python 3.11+ required
Dependencies: langchain, langgraph, faiss-cpu, honegumi, openai