Imperial College London > Talks@ee.imperial > CAS Talks > f-CNNx: Deploying Multiple CNNs in Complex AI Systems
Log inImperial users Other users No account?Information onFinding a talk Adding a talk Syndicating talks Who we are Everything else |
f-CNNx: Deploying Multiple CNNs in Complex AI SystemsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact George A Constantinides. In the construction of complex AI systems, deep neural network models are used as building blocks of a larger application. Nevertheless, deploying multiple models on a target platform poses a number of challenges. With each model trained for a different subtask of the system, the workload size and performance constraints vary accordingly. Moreover, the different models compete for the same pool of resources and hence resource allocation between models becomes a critical factor. In this talk, we will present f-CNNx, a toolflow whose goal is to automate the mapping of multiple convolutional neural networks (CNNs) on a target FPGA platform while meeting the require performance for each model. To generate an optimised multi-CNN design, f-CNNx introduces a highly-parametrised multi-CNN hardware architecture and a tunable memory access policy in order to explore a wide range of resource and bandwidth allocations. Furthermore, the toolflow incorporates the application-level importance of each model by means of multiobjective cost function in order to generate a hardware design that meets the target application’s requirements. Overall, f-CNNx overcomes the limitations of competing platforms by achieving up to 6.8x gains in performance-per-Watt over highly optimised embedded GPU designs in multi-CNN settings. This talk is part of the CAS Talks series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsAI- and HCI-related talks Andrea Picciau's list Type the title of a new list hereOther talksHybrid Renewable Energy Standalone Systems |