Our technical report “Serving Large Language Models on Huawei CloudMatrix384” is now available on ArXiv!