


default search action
Quanlu Zhang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang:
Efficient Large Language Models: A Survey. Trans. Mach. Learn. Res. 2024 (2024) - [j3]Yang Li
, Zhenhua Li, Zhenhua Han
, Quanlu Zhang, Xiaobo Ma:
Automating Cloud Deployment for Real-Time Online Foundation Model Inference. IEEE/ACM Trans. Netw. 32(2): 1509-1523 (2024) - [c25]Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei:
You Only Cache Once: Decoder-Decoder Architectures for Language Models. NeurIPS 2024 - [c24]Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang:
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. OSDI 2024: 307-323 - [c23]Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou:
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training. OSDI 2024: 347-363 - [i13]Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei:
You Only Cache Once: Decoder-Decoder Architectures for Language Models. CoRR abs/2405.05254 (2024) - 2023
- [c22]Hanyu Zhao
, Zhenhua Han
, Zhi Yang
, Quanlu Zhang
, Mingxia Li
, Fan Yang
, Qianxi Zhang
, Binyang Li
, Yuqing Yang
, Lili Qiu
, Lintao Zhang
, Lidong Zhou
:
SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters. EuroSys 2023: 883-898 - [c21]Xudong Wang, Li Lyna Zhang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. ICCV 2023: 5796-5805 - [c20]Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. ICCV 2023: 5806-5817 - [c19]Bin Lin, Ningxin Zheng, Lei Wang, Shijie Cao, Lingxiao Ma, Quanlu Zhang, Yi Zhu, Ting Cao, Jilong Xue, Yuqing Yang, Fan Yang:
Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning. MLSys 2023 - [c18]Ningxin Zheng
, Huiqiang Jiang
, Quanlu Zhang
, Zhenhua Han
, Lingxiao Ma
, Yuqing Yang
, Fan Yang
, Chengruidong Zhang
, Lili Qiu
, Mao Yang
, Lidong Zhou
:
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation. SOSP 2023: 331-347 - [i12]Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang, Fan Yang, Saeed Maleki, Yi Zhu, Xu Cao, Cheng Li, Mao Yang, Lintao Zhang, Lidong Zhou:
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction. CoRR abs/2301.08984 (2023) - [i11]Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang, Zhenhua Han, Yuqing Yang, Lingxiao Ma, Fan Yang, Lili Qiu, Mao Yang, Lidong Zhou:
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation. CoRR abs/2301.10936 (2023) - [i10]Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. CoRR abs/2303.08308 (2023) - [i9]Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. CoRR abs/2303.09730 (2023) - [i8]Yang Liu, Shen Yan, Yuge Zhang, Kan Ren, Quanlu Zhang, Zebin Ren, Deng Cai, Mi Zhang:
AutoTaskFormer: Searching Vision Transformers for Multi-task Learning. CoRR abs/2304.08756 (2023) - [i7]Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang:
Efficient Large Language Models: A Survey. CoRR abs/2312.03863 (2023) - 2022
- [c17]Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang:
Privacy-preserving Online AutoML for Domain-Specific Face Detection. CVPR 2022: 4124-4134 - [c16]Cong Guo
, Yuxian Qiu, Jingwen Leng, Chen Zhang
, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. ICCD 2022: 738-745 - [c15]Ningxin Zheng, Bin Lin, Quanlu Zhang, Lingxiao Ma, Yuqing Yang, Fan Yang, Yang Wang, Mao Yang, Lidong Zhou:
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute. OSDI 2022: 213-232 - [i6]Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang:
Privacy-preserving Online AutoML for Domain-Specific Face Detection. CoRR abs/2203.08399 (2022) - [i5]Cong Guo
, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. CoRR abs/2209.10778 (2022) - 2021
- [i4]Yuge Zhang, Chenqian Yan, Quanlu Zhang, Li Lyna Zhang, Yaming Yang, Xiaotian Gao, Yuqing Yang:
AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing. CoRR abs/2108.03001 (2021) - 2020
- [j2]Yuzhen Cao, Jiayong Mao
, Hui Yu, Qinhao Zhang
, Huiquan Wang
, Quanlu Zhang, Lingfei Guo, Fei Gao:
A Novel Hybrid Active Contour Model for Intracranial Tuberculosis MRI Segmentation Applications. IEEE Access 8: 149569-149585 (2020) - [c14]Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang, Yang Wang, Quanlu Zhang, Yaming Yang, Yunhai Tong, Jing Bai:
LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression. COLING 2020: 3225-3234 - [c13]Yang Li, Zhenhua Han, Quanlu Zhang, Zhenhua Li, Haisheng Tan:
Automating Cloud Deployment for Deep Learning Inference of Real-time Online Services. INFOCOM 2020: 1668-1677 - [c12]Hanyu Zhao, Zhenhua Han, Zhi Yang, Quanlu Zhang, Fan Yang, Lidong Zhou, Mao Yang, Francis C. M. Lau, Yuqi Wang, Yifan Xiong, Bin Wang:
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees. OSDI 2020: 515-532 - [c11]Quanlu Zhang, Zhenhua Han, Fan Yang, Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou:
Retiarii: A Deep Learning Exploratory-Training Framework. OSDI 2020: 919-936 - [c10]Chieh-Jan Mike Liang, Hui Xue, Mao Yang, Lidong Zhou, Lifei Zhu, Zhao Lucis Li, Zibo Wang, Qi Chen, Quanlu Zhang, Chuanjie Liu, Wenjun Dai:
AutoSys: The Design and Operation of Learning-Augmented Systems. USENIX ATC 2020: 323-336 - [i3]Yuge Zhang, Zejun Lin, Junyang Jiang, Quanlu Zhang, Yujing Wang, Hui Xue, Chen Zhang, Yaming Yang:
Deeper Insights into Weight Sharing in Neural Architecture Search. CoRR abs/2001.01431 (2020) - [i2]Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang
, Yang Wang, Yaming Yang, Quanlu Zhang, Yunhai Tong, Jing Bai:
LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression. CoRR abs/2004.04124 (2020) - [i1]Yuge Zhang, Quanlu Zhang, Yaming Yang:
How Does Supernet Help in Neural Architecture Search? CoRR abs/2010.08219 (2020)
2010 – 2019
- 2018
- [c9]Hanyu Zhao, Quanlu Zhang, Zhi Yang, Ming Wu, Yafei Dai:
SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines. SoCC 2018: 68-81 - [c8]Wencong Xiao, Zhenhua Han, Hanyu Zhao, Xuan Peng
, Quanlu Zhang, Fan Yang, Lidong Zhou:
Scheduling CPU for GPU-based Deep Learning Jobs. SoCC 2018: 503 - [c7]He Xiao, Zhenhua Li, Ennan Zhai, Tianyin Xu, Yang Li, Yunhao Liu, Quanlu Zhang, Yao Liu:
Towards Web-based Delta Synchronization for Cloud Storage Services. FAST 2018: 155-168 - [c6]Shenglong Li, Quanlu Zhang, Zhi Yang, Hanyu Zhao, Yafei Dai:
Building efficient and available distributed transaction with Paxos-based coding consensus. INFOCOM Workshops 2018: 373-378 - [c5]Wencong Xiao, Romil Bhardwaj, Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, Lidong Zhou:
Gandiva: Introspective Cluster Scheduling for Deep Learning. OSDI 2018: 595-610 - 2017
- [c4]Quanlu Zhang, Zhenhua Li, Zhi Yang, Shenglong Li, Shouyang Li, Yangze Guo, Yafei Dai:
DeltaCFS: Boosting Delta Sync for Cloud Storage Services by Learning from NFS. ICDCS 2017: 264-275 - 2015
- [j1]Quanlu Zhang, Shenglong Li, Zhenhua Li, Yuanjian Xing, Zhi Yang, Yafei Dai:
CHARM: A Cost-Efficient Multi-Cloud Data Hosting Scheme with High Availability. IEEE Trans. Cloud Comput. 3(3): 372-386 (2015) - [c3]Quanlu Zhang, Yafei Dai, Lintao Zhang:
DSwitch: a dual mode direct and network attached disk. SoCC 2015: 71-83 - [c2]Shenglong Li, Quanlu Zhang, Zhi Yang, Yafei Dai:
Understanding and Surpassing Dropbox: Efficient Incremental Synchronization in Cloud Storage Services. GLOBECOM 2015: 1-7 - [c1]Quanlu Zhang, Yafei Dai, Fengqian Li, Lintao Zhang:
UStore: A Low Cost Cold and Archival Data Storage System for Data Centers. ICDCS 2015: 431-441
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 21:25 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint