Add experimental attention quantization flags. by copybara-service[bot] · Pull Request #4321 · AI-Hypercomputer/maxtext

copybara-service · 2026-07-01T19:22:37Z

Add experimental attention quantization flags.

This will add two experimental flags to the splash attention config, to
quantize Q and K respectively. Attention quantization is currently a research
project so these should be turned off by default. In the future we will support
more quantization options for the backwards pass, RoPE vs. no-RoPE, more
dtypes, etc.

codecov · 2026-07-01T19:28:12Z

Codecov Report

❌ Patch coverage is 0% with 4 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/layers/attention_mla.py	0.00%	2 Missing and 2 partials ⚠️

📢 Thoughts on this report? Let us know!

This will add two experimental flags to the splash attention config, to quantize Q and K respectively. Attention quantization is currently a research project so these should be turned off by default. In the future we will support more quantization options for the backwards pass, RoPE vs. no-RoPE, more dtypes, etc. PiperOrigin-RevId: 940686402

copybara-service Bot force-pushed the test_940686402 branch from dd302f3 to 9850712 Compare July 1, 2026 19:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add experimental attention quantization flags.#4321

Add experimental attention quantization flags.#4321
copybara-service[bot] wants to merge 1 commit into
mainfrom
test_940686402

copybara-service Bot commented Jul 1, 2026

Uh oh!

codecov Bot commented Jul 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

copybara-service Bot commented Jul 1, 2026

Uh oh!

codecov Bot commented Jul 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov Bot commented Jul 1, 2026 •

edited

Loading