HPC

Drive faster breakthroughs through faster code: Get more results on your hardware today and carry your code forward to the future with code modernization.

Performance Evaluation of FPGA-based Hardware Accelerator for HPC applications

URL: https://github.com/adarshmishra-ti/Performance-Evaluation-of-FPGA-based-Hardware-Accelerator-for-HPC-applications

Description:

In this project, we aim to find if FPGAs can be considered as a viable option as a hardware accelerator, and if so, how is their performance compared to existing processors like GPGPUs in various types of HPC workloads. We have an opportunity to take benefit of the recent developments in High-Level

Posted:

A DPC++ Backend for the OCCA Portability Framework

URL: https://github.com/libocca/occa

Description:

OCCA—an open source, portable, and vendor neutral framework for parallel programming on heterogeneous platforms—is used by mission critical computational science and engineering applications of public and private sector organizations, including the U.S. Department of Energy and Shell.

Posted:

oneAPI_project_SpMV

URL: https://gitee.com/baibai1/one-api_project_-sp-mv/tree/master/

Description:

此项目是由cuda源代码实现的稀疏矩阵向量乘利用dpct工具自动转为dpc++,然后手动修改少部分代码后迁移而来的项目,主要实现了SpMV计算接口。矩阵和向量都是随机生成,矩阵采用CSR压缩格式存储计算,还未来得及利用local memory进行加速和CSRv算法优化,待有时间时再做优化。本项目的大量注释代码亦有可取之处,device_spmv函数中kernel函数大量注释为CSRv的算法实现,更符合gpu的访存模式和local memory能带来较大的加速效果。另main中的注释代码可进行设备预热,性能计时。verify函数是gpu计算结果与cpu计算结果对比,用以判断gpu部分代码正确性。

Posted:

iDVR Intelligent Digital Video Recording

URL: https://github.com/cabelo/idvr

Description:

The objective of the project is to obtain a stream of h265 videos encoded by external devices or by the server itself. But with a big difference: the stream or video will be indexed by models of convolutional neural networks. The system will find specific scenes without evaluating all media content.

Posted:

oneOLIGO

URL: https://github.com/Eug9/oneoligo.git

Description:

In project OneOligo, we are using OneAPI for implementing scalable, heterogeneous-parallel-processing algorithms that can be used quickly and accurate decode digital data stored in synthetic DNA generated by project OligoArchive.

Posted:

Bioinformatic-Algorithms

URL: https://github.com/artecs-group/nmf-dpcpp

Description:

The computing time required to process large data matrices may become impractical, even for a parallel application running on a multiprocessors cluster. NMF-DPC++ is an efficient and easy-to-use implementation of the NMF algorithm that takes advantage of the high computing performance through SYCL.

Posted: