From Qwen-3-max thinking, I remember the inference becoming veeery slow as you p...

		Alifatisk 30 days ago \| parent \| context \| favorite \| on: Qwen3.6-Plus: Towards real world agents From Qwen-3-max thinking, I remember the inference becoming veeery slow as you pushed towards 1M context, already at 300k tokens you would notice the degradation. But of course, I was using Qwen Chat, so could be a resource allocation thing.