Canonicalize MX fake quant export through Q-DQ by mhs4670go · Pull Request #762 · Samsung/TICO

mhs4670go · 2026-06-06T09:19:50Z

Introduce a separate MX fake-quant frontend op and lower it to a logical quantize_mx/dequantize_mx pair before Circle export.

Related: #436
TICO-DCO-1.0-Signed-off-by: seongwoo mhs4670go@naver.com

Introduce a separate MX fake-quant frontend op and lower it to a logical quantize_mx/dequantize_mx pair before Circle export. TICO-DCO-1.0-Signed-off-by: seongwoo <mhs4670go@naver.com>

mhs4670go · 2026-06-06T09:28:34Z

@stamalakhov

Thanks for the draft. I slighty changed a bit the codes that introduce mx_fake_quantize.

Then DecomposeFakeQuantize lowers mx_fake_quantize to quantize_mx -> dequantize_mx, and FoldQuantOps folds that Q-DQ pair into the producer node's QPARAM metadata, just like affine quantization already does. This avoids overloading quantize_mx with two different meanings.

Please feel free to give your opinions.

stamalakhov · 2026-06-08T05:32:46Z

+def CircleMXFakeQuantize():
+    """Register the eager MX fake-quantization custom operator."""
+
+    @custom_op("circle_custom::mx_fake_quantize", mutates_args=())


stamalakhov · 2026-06-08T05:43:28Z

        # TODO Add more dtypes
    }
-
+    optional_dtypes = {


👍
Although i'm not sure why these:

"mxint8": "MXINT8", "mxfp4": "MXFP4",

can not be inserted directly into dmap.

Good point. I used the indirect getattr mapping only to avoid breaking environments where circle_schema does not define MX tensor types yet. In that case, non-MX dtype conversion can still work and MX dtype conversion fails only when requested.

However, since this PR is adding MX export support, it is reasonable to require a schema version that already has MXINT8/MXFP4. I agree direct entries in dmap are simpler and clearer. I will update it that way.

stamalakhov · 2026-06-08T06:12:19Z

@stamalakhov

Thanks for the draft. I slighty changed a bit the codes that introduce mx_fake_quantize.

Then DecomposeFakeQuantize lowers mx_fake_quantize to quantize_mx -> dequantize_mx, and FoldQuantOps folds that Q-DQ pair into the producer node's QPARAM metadata, just like affine quantization already does. This avoids overloading quantize_mx with two different meanings.

Please feel free to give your opinions.

@mhs4670go
I've tried to keep existing codes untouched. So that they can be refactored later. I planned to introduce Q-DQ after #760, I I supposed to introduce serialization of MX in another PR, as it is contained in another module. Draft contains all of these features. Should i approve #762?

mhs4670go · 2026-06-08T12:49:23Z

@stamalakhov

Thanks for the clarification. I understand your intention: #760 introduces the MX Q/DQ stubs first, and the Q-DQ canonicalization and serialization could be layered on top in later PRs. I'm sorry if my comment came across as dismissing your code. That was not my intention at all.

Even though it can be refactored later, my preference was let #762 supersede #760 if you are comfortable with the scope, because the fake-quant op, Q-DQ decomposition, and folding logic are tightly coupled and are easier to review as one coherent export flow.

From the design perspective, this keeps the MX export path consistent with the affine fake-quant path: fake quant API -> Q-DQ decomposition -> qparam folding into producer metadata -> Circle qmodel export.

If you're okay with it, I'd prefer to proceed directly on top of the refactored code, even if it may require a bit of extra work.

stamalakhov · 2026-06-08T12:54:23Z

If you're okay with it, I'd prefer to proceed directly on top of the refactored code, even if it may require a bit of extra work.

I'm ok with it.

stamalakhov

LGTM!

Canonicalize MX fake quant export through Q-DQ

7dc00e5

Introduce a separate MX fake-quant frontend op and lower it to a logical quantize_mx/dequantize_mx pair before Circle export. TICO-DCO-1.0-Signed-off-by: seongwoo <mhs4670go@naver.com>

mhs4670go force-pushed the mx branch from 09aa40f to 7dc00e5 Compare June 6, 2026 09:25

fix test.

c5123c7

mhs4670go force-pushed the mx branch from 1272b4e to c5123c7 Compare June 8, 2026 03:19

stamalakhov reviewed Jun 8, 2026

View reviewed changes

apply comment.

45af0c5

stamalakhov approved these changes Jun 8, 2026

View reviewed changes

mhs4670go merged commit f3098af into Samsung:main Jun 8, 2026
7 checks passed

mhs4670go deleted the mx branch June 8, 2026 13:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canonicalize MX fake quant export through Q-DQ#762

Canonicalize MX fake quant export through Q-DQ#762
mhs4670go merged 3 commits into
Samsung:mainfrom
mhs4670go:mx

mhs4670go commented Jun 6, 2026

Uh oh!

mhs4670go commented Jun 6, 2026

Uh oh!

stamalakhov Jun 8, 2026

Uh oh!

stamalakhov Jun 8, 2026

Uh oh!

mhs4670go Jun 8, 2026

Uh oh!

stamalakhov commented Jun 8, 2026

Uh oh!

mhs4670go commented Jun 8, 2026

Uh oh!

stamalakhov commented Jun 8, 2026

Uh oh!

stamalakhov left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mhs4670go commented Jun 6, 2026

Uh oh!

mhs4670go commented Jun 6, 2026

Uh oh!

stamalakhov Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov commented Jun 8, 2026

Uh oh!

mhs4670go commented Jun 8, 2026

Uh oh!

stamalakhov commented Jun 8, 2026

Uh oh!

stamalakhov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants