llm-arch-research

mirror of https://github.com/pese-git/llm-arch-research.git synced 2026-05-16 10:09:42 +00:00

Files

Sergey Penkovsky db0ab511d1 feat(gpt2): add Gpt2Decoder module, refactor model and add tests

- Implemented core/gpt2_decoder.py: transformer decoder block with kv cache in GPT2 style
- Refactored models/gpt/gpt2.py to use new Gpt2Decoder, improved documentation
- Added tests/core/test_gpt2_decoder.py for main features and cache
- Temporarily skipped HF proxy integration test for compatibility

2025-10-31 15:35:54 +03:00

generate_with_hf_tools.py

Рефакторинг: единообразие оформления кода (пробелы, кавычки, пустые строки), без изменения логики по всему проекту.

2025-10-06 22:57:19 +03:00

simple_hf_training.py

2025-10-06 22:57:19 +03:00

test_hf_proxy.py

feat(gpt2): add Gpt2Decoder module, refactor model and add tests

2025-10-31 15:35:54 +03:00

train_with_hf_trainer.py

2025-10-06 22:57:19 +03:00