| Version 5 (modified by , 12 years ago) ( diff ) |
|---|
- Parsec Benchmark
http://parsec.cs.princeton.edu
- Hybrid benchmarks (with mpi cuda ...)
- CORAL
https://asc.llnl.gov/CORAL-benchmarks
- Multi-zone NAS Parallel Benchmarks
https://www.nas.nasa.gov/assets/pdf/techreports/2003/nas-03-010.pdf
- Start to look at MPI 3
- - Traditional multi-core weak memory : non blocking data structure
- Disjoint memory space (CPU, GPU): CUDA
- Lock-free data structure benchmarks
http://www.cse.iitk.ac.in/users/mainakc/lockfree.html
http://htor.inf.ethz.ch/publications//img/hoefler-dsde-protocols.pdf
Look at Algorithm 2 : Nonblocking consensus - uses MPI-3 non-blocking collectives
- Hybrid Examples from courses:
UW course:
CSS 534: Parallel Programming in Grid and Cloud - Programming Tasks
HW 2 is a good example:
courses.washington.edu/css534/prog/prog2.pdf
A Georgia Tech course:
CSE 6230: HPC Tools and Apps. — CSE 6230: HPC Tools and Apps
and the relevant assignment:
stumptown.cc.gt.atl.ga.us:8080/cse6230-hpcta-fa09/hw3.pdf
A course at cornell: http://www.cac.cornell.edu/education/Training/Intro/Hybrid-090529.pdf
Another course
ITCS 4145 Cluster Computing
and assignment 4:
coitweb.uncc.edu/~abw/ITCS4145S13/Assignments/assign4S13.pdf
