Export 16 results:
Search results for biblio_year:2021
Filters: Author is Mei Wen  [Reset Search]
2016
J. Langguth, Q. Lan, N. Gaur, X. Cai and M. Wen. Enabling Tissue-Scale Cardiac Simulations Using Heterogeneous Computing on Tianhe-2 In IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS), Edited by C. Zhang. ACM/IEEE, 2016.PDF icon langguth_etal_icpads2016.pdf (1.29 MB)
2015
H. Su, X. Cai, M. Wen and C. Zhang. "An Analytical GPU Performance Model for 3D Stencil Computations from the Angle of Data Traffic." The Journal of Supercomputing 71, no. 7 (2015): 2433-2453.PDF icon su_etal_js2015.pdf (1.22 MB)
X. Dong, M. Wen, J. Chai, X. Cai, M. Zhao and C. Zhang. "Communication-Hiding Programming for Clusters with Multi-Coprocessor Nodes." Concurrency and Computation: Practice and Experience 27, no. 16 (2015): 4172-4185.PDF icon cpe3507-online-version.pdf (1.83 MB)
D. Huang, C. Xun, N. Wu, M. Wen, C. Zhang, X. Cai and Q. Yang. "Enabling a Uniform OpenCL Device View for Heterogeneous Platforms." IEICE Transactions on Information and Systems E98-D, no. 4 (2015): 812-823.
J. Chai, J. E. Hake, N. Wu, M. Wen, X. Cai, G. T. Lines, J. Yang, H. Su, C. Zhang and X. Liao. "Towards Simulation of Subcellular Calcium Dynamics at Nanometre Resolution." International Journal of High Performance Computing Applications 29, no. 1 (2015): 51-63.PDF icon ijhpca-29-1-p51_63.pdf (4.13 MB)
2014
D. Huang, M. Wen, C. Xun, D. Chen, X. Cai, Y. Qiao, N. Wu and C. Zhang. Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs In Proceedings of Euro-Par 2014, Edited by F. Silva. Vol. 8632. LNCS 8632. Berlin Heidelberg New York: Springer, 2014.
M. Wen, H. Su, W. Wei, N. Wu, X. Cai and C. Zhang. "High Efficient Sedimentary Basin Simulations on Hybrid CPU-GPU Clusters." Cluster Computing 17 (2014): 359-369.
X. Dong, J. Chai, J. Yang, M. Wen, N. Wu, X. Cai, C. Zhang and Z. Chen. Utilizing Multiple Xeon Phi Coprocessors on One Compute Node In International Conference on Algorithms and Architectures for Parallel Processing, Edited by X. Sun. Vol. 8631. LNCS 8631. Berlin Heidelberg New York: Springer, 2014.
2013
W. Wei, S. Clark, H. Su, M. Wen and X. Cai. "Balancing Efficiency and Accuracy for Sediment Transport Simulations." Computational Science & Discovery 6 (2013): 015011.
H. Su, N. Wu, M. Wen, C. Zhang and X. Cai. On the GPU Performance of 3D Stencil Computations Implemented in OpenCL In Proceedings of International Supercomputing Conference, ISC 2013, Edited by J. M. Kunkel, T. Ludwig and H. W. Meuer. Vol. 7905. Lecture Notes in Computer Science 7905. Berlin Heidelberg New York: Springer, 2013.
H. Su, N. Wu, M. Wen, C. Zhang and X. Cai. On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations In Proceedings of IEEE 19th International Conference on Parallel and Distributed Systems. Los Alamitos, California • Washington • Tokyo: IEEE, 2013.
H. Su, N. Wu, M. Wen, C. Zhang and X. Cai. Performance of Sediment Transport Simulations on NVIDIA's Kepler Architecture In The International Conference on Computational Science, ICCS 2013, Edited by V. Alexandrov, M. Lees, V. Krzhizhanovskaya, J. Dongarra and P. M. A. Sloot. Vol. 18. Procedia Computer Science 18. Elsevier, 2013.
J. Chai, H. Su, M. Wen, X. Cai, N. Wu and C. Zhang. "Resource-Efficient Utilization of CPU/GPU-Based Heterogeneous Supercomputers for Bayesian Phylogenetic Inference." The Journal of Supercomputing 66 (2013): 364-380.
J. Chai, M. Wen, N. Wu, D. Huang, J. Yang, X. Cai, C. Zhang and Q. Yang. "Simulating Cardiac Electrophysiology in the Era of GPU-Cluster Computing." IEICE Transactions on Information and Systems E96-D (2013): 2587-2595.