Hi Authors, Thank you for your great work! I'd like to know does FLAP support kv cache reduction instead of speedup of query/key computation?
Hi Authors,
Thank you for your great work! I'd like to know does FLAP support kv cache reduction instead of speedup of query/key computation?