Fused-layer CNN accelerators M Alwani, H Chen, M Ferdman, P Milder 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 781 | 2016 |
Maximizing CNN accelerator efficiency through resource partitioning Y Shen, M Ferdman, P Milder ACM SIGARCH Computer Architecture News 45 (2), 535-547, 2017 | 410 | 2017 |
Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs? ES Chung, PA Milder, JC Hoe, K Mai Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on …, 2010 | 343 | 2010 |
Computer generation of hardware for linear digital signal processing transforms P Milder, F Franchetti, JC Hoe, M Püschel ACM Transactions on Design Automation of Electronic Systems (TODAES) 17 (2 …, 2012 | 159 | 2012 |
Escher: A CNN accelerator with flexible buffering to minimize off-chip transfer Y Shen, M Ferdman, P Milder 2017 IEEE 25Th annual international symposium on field-programmable custom …, 2017 | 128 | 2017 |
Generation of optical OFDM signals using 21.4 GS/s real time digital signal processing Y Benlachtar, PM Watts, R Bouziane, P Milder, D Rangaraj, A Cartolano, ... Optics Express 17 (20), 17658-17668, 2009 | 88 | 2009 |
Automatic generation of customized discrete Fourier transform IPs G Nordin, PA Milder, JC Hoe, M Püschel Proceedings of the 42nd annual Design Automation Conference, 471-474, 2005 | 88 | 2005 |
Efficient methods for natural language processing: A survey M Treviso, JU Lee, T Ji, B Aken, Q Cao, MR Ciosici, M Hassid, K Heafield, ... Transactions of the Association for Computational Linguistics 11, 826-860, 2023 | 86 | 2023 |
Formal datapath representation and manipulation for implementing DSP transforms PA Milder, F Franchetti, JC Hoe, M Püschel Proceedings of the 45th annual Design Automation Conference, 385-390, 2008 | 66 | 2008 |
Overcoming resource underutilization in spatial CNN accelerators Y Shen, M Ferdman, P Milder 2016 26th International Conference on field programmable logic and …, 2016 | 63 | 2016 |
Permuting streaming data using RAMs M Püschel, PA Milder, JC Hoe Journal of the ACM (JACM) 56 (2), 1-34, 2009 | 63 | 2009 |
Computer generation of streaming sorting networks M Zuluaga, P Milder, M Püschel Proceedings of the 49th Annual Design Automation Conference, 1245-1253, 2012 | 62 | 2012 |
Real-time OFDM or Nyquist pulse generation–which performs better with limited resources? R Schmogrow, R Bouziane, M Meyer, PA Milder, PC Schindler, RI Killey, ... Optics Express 20 (26), B543-B551, 2012 | 56 | 2012 |
Optical OFDM for the data center Y Benlachtar, R Bouziane, RI Killey, CR Berger, P Milder, R Koutsoyannis, ... 2010 12th International Conference on Transparent Optical Networks, 1-4, 2010 | 56 | 2010 |
" Smart" design space sampling to predict Pareto-optimal solutions M Zuluaga, A Krause, P Milder, M Püschel Proceedings of the 13th ACM SIGPLAN/SIGBED International Conference on …, 2012 | 52 | 2012 |
System, method, and accelerator to process convolutional neural network layers M Ferdman, P Milder, M Alwani US Patent 10,726,330, 2020 | 51 | 2020 |
Theoretical and experimental evaluation of clipping and quantization noise for optical OFDM CR Berger, Y Benlachtar, RI Killey, PA Milder Optics express 19 (18), 17713-17728, 2011 | 51 | 2011 |
Streaming sorting networks M Zuluaga, P Milder, M Püschel ACM Transactions on Design Automation of Electronic Systems (TODAES) 21 (4 …, 2016 | 43 | 2016 |
Generating FPGA-accelerated DFT libraries P D'Alberto, PA Milder, A Sandryhaila, F Franchetti, JC Hoe, JMF Moura, ... 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines …, 2007 | 42 | 2007 |
Fast and accurate resource estimation of automatically generated custom DFT IP cores PA Milder, M Ahmad, JC Hoe, M Püschel Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field …, 2006 | 38 | 2006 |