llm-arch-research

mirror of https://github.com/pese-git/llm-arch-research.git synced 2026-01-23 13:00:54 +00:00

Files

Sergey Penkovsky 25caf69ced refactor(gpt1): migrate Decoder to GptDecoder, unify API, and update tests

- Renamed Decoder (and decoder.py) to GptDecoder (gpt_decoder.py) for clarity in GPT1
- Implemented support for cache and use_cache parameters in GptDecoder.forward (API unification)
- Adapted all usages in GPT model to use new decoder structure and handle tuple output
- Refactored core tests (test_gpt.py, test_gpt_decoder.py, test_basic.py) to correctly expect tuple or logits and ensure shape/device checks work as before
- Improved clarity and future extensibility for autoregressive generation and benchmarking
- No changes to architectural details or training loop; pure API and test modernization

2025-10-22 16:27:08 +03:00

core

refactor(gpt1): migrate Decoder to GptDecoder, unify API, and update tests

2025-10-22 16:27:08 +03:00

datasets

doc(datasets): update docstrings and tests

2025-10-17 10:49:45 +03:00

models

refactor(gpt1): migrate Decoder to GptDecoder, unify API, and update tests

2025-10-22 16:27:08 +03:00

tokenizers

Рефакторинг: единообразие оформления кода (пробелы, кавычки, пустые строки), без изменения логики по всему проекту.