From 61552e4450abc5924589a1916b1d7f63f02f151a Mon Sep 17 00:00:00 2001
From: Yamini Nimmagadda <yamini.nimmagadda@intel.com>
Date: Mon, 12 Jan 2026 17:37:26 -0800
Subject: [PATCH] Update OPENVINO.md

---
 docs/backend/OPENVINO.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/backend/OPENVINO.md b/docs/backend/OPENVINO.md
index 7c2e733b03..3395b70e60 100644
--- a/docs/backend/OPENVINO.md
+++ b/docs/backend/OPENVINO.md
@@ -52,7 +52,7 @@ Accuracy and performance optimizations for quantized models are still work in pr
 
 - **Primary supported quantization scheme is `Q4_0`**
 - `Q4_0` and `Q4_1` tensors are requantized to int4 gs128 symmetric
-- `Q6_K` tensors are requentized to int8 except for the token embedding matrix
+- `Q6_K` tensors are requentized to int8 except for the token embedding matrix which is dequantized to fp16
 
 #### Additional Notes