Skip to content

Conversation

@thejhh
Copy link
Contributor

@thejhh thejhh commented May 24, 2025

…eturn input tokens for test setup to pass all inference tests. See #190.

…eturn input tokens for test setup to pass all inference tests. See #190.
@thejhh thejhh self-assigned this May 24, 2025
@thejhh thejhh added the bitnet BitNet implementation label May 24, 2025
Jaakko Heusala added 2 commits May 24, 2025 03:35
Add tensor operations for token decoding including softmax, matrix multiplication, and scaling. Implement efficient parallel processing for inference. Part of #190.
@thejhh thejhh linked an issue May 24, 2025 that may be closed by this pull request
Jaakko Heusala added 4 commits May 24, 2025 17:00
- Remove unused interfaces.go file\n- Update attention and tensor operations for improved inference\n- Enhance math operations and layer implementations\n- Update tests and benchmarks for modified components
- Replace fmt.Errorf with static error values in attention_sublayer.go\n- Add DebugLog calls for dynamic error details\n- Fix tensor operation error handling in test files\n- Improve test robustness with proper error checks
@thejhh thejhh force-pushed the 190-token-decoding-inference-loop branch from 37291f6 to 3a73135 Compare June 1, 2025 11:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bitnet BitNet implementation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Token Decoding (Inference Loop)

2 participants