|
| 1 | +<!DOCTYPE html> |
| 2 | +<html lang="en"> |
| 3 | +<head> |
| 4 | + <meta charset="UTF-8"> |
| 5 | + <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| 6 | + <meta name="description" content="Public link catalog extracted from the CS6969 syllabus and side-tab materials."> |
| 7 | + <title>CS6969 Link Catalog</title> |
| 8 | + <link rel="stylesheet" href="css/style.css"> |
| 9 | +</head> |
| 10 | +<body> |
| 11 | + <header class="hero"> |
| 12 | + <div class="hero-inner"> |
| 13 | + <p class="eyebrow">CS6969 Resource Index</p> |
| 14 | + <h1>Full Link Catalog</h1> |
| 15 | + <p class="lede"> |
| 16 | + Public URLs gathered from the main syllabus document and its side tabs, |
| 17 | + organized so students can browse software, papers, tutorials, slides, |
| 18 | + and project references more directly. |
| 19 | + </p> |
| 20 | + </div> |
| 21 | + </header> |
| 22 | + |
| 23 | + <nav class="topnav"> |
| 24 | + <a href="index.html">Home</a> |
| 25 | + <a href="#course">Course Docs</a> |
| 26 | + <a href="#slides">Slides</a> |
| 27 | + <a href="#software">Software</a> |
| 28 | + <a href="#papers">Papers</a> |
| 29 | + <a href="#aws">AWS / Trainium</a> |
| 30 | + <a href="#systems">CHPC / Tools</a> |
| 31 | + </nav> |
| 32 | + |
| 33 | + <main> |
| 34 | + <section class="panel alt"> |
| 35 | + <h2>About This Catalog</h2> |
| 36 | + <p> |
| 37 | + This page collects the public links embedded throughout the shared |
| 38 | + course planning material. It intentionally excludes a few non-public or |
| 39 | + unsafe entries, such as localhost-only URLs, one-time tokenized raw |
| 40 | + links, and machine-local pseudo-links that are not meaningful on a |
| 41 | + public course website. |
| 42 | + </p> |
| 43 | + </section> |
| 44 | + |
| 45 | + <section id="course" class="panel alt"> |
| 46 | + <h2>Course Documents and Administration</h2> |
| 47 | + <ul> |
| 48 | + <li><a href="https://docs.google.com/document/d/10IxljmaHBsQD4aQsfeQ6ge0RgAawlupg5fAK2zCCI6Y/edit">Main syllabus Google Doc</a></li> |
| 49 | + <li><a href="https://class-schedule.app.utah.edu/main/1264/class_list.html?subject=CS">University of Utah class schedule</a></li> |
| 50 | + <li><a href="https://registrar.utah.edu/academic-calendars/spring2026.php">Spring 2026 academic calendar</a></li> |
| 51 | + <li><a href="mailto:cs6969@googlegroups.com">Class Google Group email</a></li> |
| 52 | + <li><a href="http://cs.utah.edu">School of Computing</a></li> |
| 53 | + <li><a href="https://dblp.org/pid/192/0371.html">Professor Sreepathi Pai on DBLP</a></li> |
| 54 | + <li><a href="https://soarlab.org/team/baranowski/">Mark Baranowski profile</a></li> |
| 55 | + </ul> |
| 56 | + </section> |
| 57 | + |
| 58 | + <section id="slides" class="panel alt"> |
| 59 | + <h2>Slides, Notes, and Shared Class Documents</h2> |
| 60 | + <ul> |
| 61 | + <li><a href="https://docs.google.com/document/d/1hh-4Mn0vLGb3wZqg_fAu4COCmP2Hez7fUV8VY0CQpGs/edit?usp=sharing">Course organization notes</a></li> |
| 62 | + <li><a href="https://docs.google.com/presentation/d/1-KQhZabAgIDfrjPUgkTy4F6k2BqGpFiPdeS4oC_BKsw/edit?usp=sharing">Ganesh introductory slides</a></li> |
| 63 | + <li><a href="https://cs.rochester.edu/~sree/courses/cs6969-spring-2026/sree-intro.pdf">Sree intro slides</a></li> |
| 64 | + <li><a href="https://docs.google.com/presentation/d/1wLwiYiBSj3b4UrlhirfCdA2O1dPMqRIJh1LMdygTYuo/edit?usp=sharing">Number systems and tools slides</a></li> |
| 65 | + <li><a href="https://cs.rochester.edu/~sree/courses/cs6969-spring-2026/sree-perf-model.pdf">Performance model slides</a></li> |
| 66 | + <li><a href="https://cs.rochester.edu/~sree/courses/cs6969-spring-2026/sree-gpu-performance.pdf">GPU performance slides</a></li> |
| 67 | + <li><a href="https://docs.google.com/presentation/d/1mijmFYeSxJ5pfZKFdObCJYOC4lZWRzosEBBJF5XHgtM/edit?usp=sharing">GPU execution and schedule slides</a></li> |
| 68 | + <li><a href="https://docs.google.com/presentation/d/1C6vt0McbS2if2_AJoVdhWAzqrTK6owV8I71zlvUIOcE/edit?usp=sharing">AWS training slides</a></li> |
| 69 | + <li><a href="https://docs.google.com/presentation/d/1OVkwX4hO3V6tqrvNE9VwxiHOSabSzhxauVZp6ombeqU/edit?usp=sharing">Neuron architecture slides</a></li> |
| 70 | + <li><a href="https://docs.google.com/presentation/d/1usOJAgZCNI3hfuqcj6FZGrzEPrbEfTliG0mRRDcEEfs/edit?usp=sharing">Project and discussion slides</a></li> |
| 71 | + <li><a href="https://docs.google.com/presentation/d/1T9MSmJenR4EYb1L_14bUoOJWw-zd2CNq/edit?usp=sharing&ouid=111495655245157297413&rtpof=true&sd=true">Student or guest presentation deck</a></li> |
| 72 | + <li><a href="https://docs.google.com/presentation/d/1bYp46_MeOXH2d4MLdi26QyK8J3T8uYkIVsy7DJmx_Aw/edit?usp=sharing">MLIR / MLIR-AIR slides</a></li> |
| 73 | + <li><a href="https://docs.google.com/presentation/d/13Wm5mQldICWj227BsHivin1ZhueXy4UJKfQorlbQ4KY/edit?usp=sharing">Faial / GKLEE slides</a></li> |
| 74 | + <li><a href="https://docs.google.com/presentation/d/1HtwRKhrrR_KWUkXmLafSuZEibsBGaiweGKG5agN1MQQ/edit?slide=id.g39d208624b6_0_943#slide=id.g39d208624b6_0_943">NVBit and barrier instrumentation slides</a></li> |
| 75 | + <li><a href="https://docs.google.com/presentation/d/1pFQFBpJLGdtykpkOqhHX3gMY3YT7BSe107N7Np8Nkkw/edit?usp=sharing">SLEEK / later-semester slides</a></li> |
| 76 | + <li><a href="https://docs.google.com/presentation/d/19PnhI6rXAj9u4xkFHxK2075Zi_yGHFiRlnmneCUE7ow/edit?usp=sharing">Transform dialect / late-semester slides</a></li> |
| 77 | + <li><a href="https://drive.google.com/file/d/1s_U_C0-a3IPqwqosmPp-EyVOo0mR033I/view?usp=drive_link">Hoare logic supplemental file</a></li> |
| 78 | + <li><a href="https://drive.google.com/file/d/1GEJSDhAD597nMmb1t7WIlmR71VmgPez3/view?usp=sharing">AWS or profiling supplemental video/file</a></li> |
| 79 | + <li><a href="https://drive.google.com/file/d/1GEJSDhAD597nMmb1t7WIlmR71VmgPez3/view?usp=drive_link">AWS or profiling supplemental file mirror</a></li> |
| 80 | + <li><a href="https://drive.google.com/file/d/1jVM5Bu1RYEB2E9DlO_JojVinG7bPADWL/view?usp=drive_link">Additional AWS training video/file</a></li> |
| 81 | + <li><a href="https://docs.google.com/document/d/1udXciT93VtBP8rbrGBt8AcQoacZ5dxR6QHEqGaqBgfw/edit?usp=sharing">Software pipelining notes</a></li> |
| 82 | + <li><a href="https://www.overleaf.com/read/pcfjvhpzghpt#9ec326">Assignment Overleaf workspace</a></li> |
| 83 | + <li><a href="https://www.overleaf.com/read/kfzrpddjsdpn#6b6082">Asg-3 Overleaf workspace</a></li> |
| 84 | + <li><a href="https://www.overleaf.com/read/wxrsxmdttcgw#da21ab">AWS/reporting Overleaf workspace</a></li> |
| 85 | + </ul> |
| 86 | + </section> |
| 87 | + |
| 88 | + <section id="software" class="panel alt"> |
| 89 | + <h2>Software, Repositories, and Tooling</h2> |
| 90 | + <ul> |
| 91 | + <li><a href="https://github.com/parfloat/parfloat-class">Course software repository</a></li> |
| 92 | + <li><a href="https://github.com/parfloat/parfloat-class/tree/main/TILUS">Tilus course checkout</a></li> |
| 93 | + <li><a href="https://github.com/parfloat/parfloat-class/tree/main/AIR2CUDA">AIR2CUDA course checkout</a></li> |
| 94 | + <li><a href="https://github.com/parfloat/parfloat-class/tree/main/MLIRXformTut">MLIR transform tutorial checkout</a></li> |
| 95 | + <li><a href="https://github.com/parfloat/parfloat-class/tree/gg_nixnan/barrier_nvbit/nvbit_release/tools/nixnan">Barrier NVBit tool checkout</a></li> |
| 96 | + <li><a href="https://github.com/parfloat/parfloat-class/blob/gg_nixnan/barrier_nvbit/docs/barrier_test_execution.pdf">Barrier NVBit execution notes</a></li> |
| 97 | + <li><a href="https://github.com/parfloat/parfloat-class/tree/gg_nixnan/nixnan/test/lu_solver/demo2">Nixnan LU demo</a></li> |
| 98 | + <li><a href="https://github.com/parfloat/parfloat-class/blob/gg_nixnan/nixnan/test/histogram/Results-3-3-higher-condnum.md">Nixnan histogram results</a></li> |
| 99 | + <li><a href="https://github.com/apache/tvm-ffi?tab=readme-ov-file">TVM FFI repository</a></li> |
| 100 | + <li><a href="https://tvm.apache.org/ffi/">TVM FFI documentation</a></li> |
| 101 | + <li><a href="https://github.com/eunomia-bpf/nvbit-tutorial/tree/main/tools/instr_count#what-are-predicates">NVBit instrumentation tutorial</a></li> |
| 102 | + <li><a href="https://github.com/burtscher/SLEEK/">SLEEK repository</a></li> |
| 103 | + <li><a href="https://github.com/AMD-AGI/GEAK-agent">GEAK repository</a></li> |
| 104 | + <li><a href="https://github.com/NVIDIA/TileGym">TileGym repository</a></li> |
| 105 | + <li><a href="https://github.com/rasbt">Sebastian Raschka repositories</a></li> |
| 106 | + <li><a href="https://github.com/aws-neuron/nki-samples">AWS Neuron NKI samples</a></li> |
| 107 | + <li><a href="https://github.com/aws-neuron/nki-samples/blob/main/src/nki_samples/tutorials/attention_fwd_performance/attention_kernels.py">NKI attention kernels source</a></li> |
| 108 | + <li><a href="https://github.com/stanford-cs149/asst4-trainium2">Stanford CS149 Trainium-2 assignment</a></li> |
| 109 | + <li><a href="https://github.com/keram88/aws-101-for-tranium?tab=readme-ov-file#5-automating-login">AWS Trainium starter guide</a></li> |
| 110 | + <li><a href="https://colab.research.google.com/github/sdiehl/gpu-offload/blob/main/Minimal.ipynb#scrollTo=XMQS0HOhCwvj">Stephen Diehl GPU offload notebook</a></li> |
| 111 | + <li><a href="https://stanford-cs336.github.io/spring2025/">Stanford CS336 LLM material</a></li> |
| 112 | + </ul> |
| 113 | + </section> |
| 114 | + |
| 115 | + <section id="papers" class="panel alt"> |
| 116 | + <h2>Papers and Reading Links</h2> |
| 117 | + <ul> |
| 118 | + <li><a href="https://dl.acm.org/doi/10.1145/1882291.1882320">Scalable SMT-based verification of GPU kernel functions</a></li> |
| 119 | + <li><a href="https://ieeexplore.ieee.org/document/10289219">Facile: Fast, Accurate, and Interpretable Basic-Block Throughput Prediction</a></li> |
| 120 | + <li><a href="https://dl.acm.org/doi/10.1145/3524059.3532396">uiCA throughput prediction paper</a></li> |
| 121 | + <li><a href="https://cogumbreiro.github.io/assets/faial-popl26.pdf">Faial modular static cost analysis PDF</a></li> |
| 122 | + <li><a href="https://cogumbreiro.github.io/assets/faial-fmsd23.pdf">Memory Access Protocols / Faial-related paper PDF</a></li> |
| 123 | + <li><a href="https://arxiv.org/pdf/2504.12984">Tilus paper PDF</a></li> |
| 124 | + <li><a href="https://arxiv.org/abs/2509.21039">Mojo in HPC paper</a></li> |
| 125 | + <li><a href="https://graphics.pixar.com/library/RenderManXPU/">RenderMan XPU paper/page</a></li> |
| 126 | + <li><a href="https://arxiv.org/abs/2512.04226">tritonBLAS paper</a></li> |
| 127 | + <li><a href="https://arxiv.org/pdf/2511.13940">ParallelKittens paper</a></li> |
| 128 | + <li><a href="https://dl.acm.org/doi/10.1145/3001834">Hoare Logic of GPU Programs</a></li> |
| 129 | + <li><a href="https://arxiv.org/pdf/1903.07486">LLVM/IR or GPU optimization reading</a></li> |
| 130 | + <li><a href="https://openreview.net/pdf?id=0fJfVOSUra">OpenReview paper linked in syllabus</a></li> |
| 131 | + <li><a href="https://arxiv.org/abs/2511.08083">Recent paper linked in syllabus</a></li> |
| 132 | + <li><a href="https://arxiv.org/pdf/2002.11054">Additional accelerator/compiler reading</a></li> |
| 133 | + <li><a href="https://arxiv.org/abs/2510.14871">From Loop Nests to Silicon: Mapping AI Workloads onto AMD NPUs with MLIR-AIR</a></li> |
| 134 | + <li><a href="https://dl.acm.org/doi/epdf/10.1145/3318464.3380595">MLIR paper</a></li> |
| 135 | + <li><a href="https://dl.acm.org/doi/pdf/10.1145/2145816.2145844">GKLEE paper PDF</a></li> |
| 136 | + <li><a href="https://dl.acm.org/doi/10.1145/3352460.3358307">Another correctness/performance paper linked in syllabus</a></li> |
| 137 | + <li><a href="https://dl.acm.org/doi/10.1145/103162.103163">Goldberg floating-point classic</a></li> |
| 138 | + <li><a href="https://dl.acm.org/doi/pdf/10.1145/3736112.3736144">Recent linked paper PDF</a></li> |
| 139 | + <li><a href="https://userweb.cs.txstate.edu/~burtscher/papers/ipdps26.pdf">SLEEK IPDPS 2026 paper</a></li> |
| 140 | + <li><a href="https://llvm.org/devmtg/2025-10/slides/quick_talks/baskaran_slama.pdf">LLVM developer meeting slides</a></li> |
| 141 | + <li><a href="https://arxiv.org/pdf/2602.19762">Recent arXiv paper linked in syllabus</a></li> |
| 142 | + <li><a href="https://research.nvidia.com/publication/2026-03_hunting-cuda-bugs-scale-cufuzz">CuFuzz NVIDIA research page</a></li> |
| 143 | + <li><a href="https://dl.acm.org/doi/10.1145/3696443.3708922">Transform dialect or related compiler paper</a></li> |
| 144 | + <li><a href="https://arxiv.org/html/2504.12984v3">Tilus paper HTML view</a></li> |
| 145 | + <li><a href="https://arxiv.org/pdf/2511.12638">Equivalence Checking of GPU Kernels</a></li> |
| 146 | + <li><a href="https://arxiv.org/abs/2507.23194">GEAK paper</a></li> |
| 147 | + <li><a href="https://arxiv.org/abs/2511.12294">ProofWright paper</a></li> |
| 148 | + <li><a href="https://arxiv.org/abs/2502.10517">KernelBench paper</a></li> |
| 149 | + <li><a href="https://drops.dagstuhl.de/storage/00lipics/lipics-vol263-ecoop2023/LIPIcs.ECOOP.2023.13/LIPIcs.ECOOP.2023.13.pdf">Dynamic determinacy race detection paper PDF</a></li> |
| 150 | + <li><a href="https://www.arxiv.org/abs/2512.18134">Optimal software pipelining and warp specialization for tensor core GPUs</a></li> |
| 151 | + <li><a href="https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=476167">Aiken 1995 software pipelining paper</a></li> |
| 152 | + <li><a href="https://vsl.cis.udel.edu/trac/civl/wiki">CIVL model checker</a></li> |
| 153 | + <li><a href="https://doi.org/10.1145/2076450.2076465">You don't know jack about shared variables or memory models</a></li> |
| 154 | + <li><a href="https://doi.org/10.1145/1543135.1542490">FastTrack race detection</a></li> |
| 155 | + <li><a href="https://doi.org/10.1145/2398857.2384625">GPUVerify paper</a></li> |
| 156 | + <li><a href="https://vercors.ewi.utwente.nl/">Vercors tool</a></li> |
| 157 | + <li><a href="https://ris.utwente.nl/ws/portalfiles/portal/24315485/Saeed_Darabi.pdf">Saeed Darabi thesis</a></li> |
| 158 | + <li><a href="https://dl.acm.org/doi/10.5555/3433701.3433768">Scalable yet rigorous floating-point error analysis</a></li> |
| 159 | + <li><a href="https://colab.research.google.com/drive/1W_FoAnQgYsmiQFlLEN_Pc3PwXl2NOhKB?usp=sharing">SMT Colab notebook</a></li> |
| 160 | + <li><a href="http://fpanalysistools.org">FP analysis tools</a></li> |
| 161 | + <li><a href="https://herbie.uwplse.org/">Herbie floating-point rewriting</a></li> |
| 162 | + <li><a href="https://vowi.fsinf.at/images/8/84/TU_Wien-GPU_Architectures_and_Programming_VU_(Bartocci)_-_GPU_Computing_Gems_Emerald_Edition.pdf">GPU Computing Gems</a></li> |
| 163 | + <li><a href="https://arxiv.org/abs/2503.17893">Recent performance-model paper</a></li> |
| 164 | + <li><a href="https://dl.acm.org/doi/10.1145/3725798.3725801">Shared-memory atomic bottlenecks paper DOI</a></li> |
| 165 | + <li><a href="https://www.sciencedirect.com/science/article/pii/S0743731522002295?dgcid=SSRN_redirect_SD&ssrnid=4059952">Analytical performance models for GPUs</a></li> |
| 166 | + </ul> |
| 167 | + </section> |
| 168 | + |
| 169 | + <section id="aws" class="panel alt"> |
| 170 | + <h2>AWS Trainium, Neuron, and Profiling Links</h2> |
| 171 | + <ul> |
| 172 | + <li><a href="https://awsdocs-neuron.readthedocs-hosted.com/en/latest/tools/neuron-explorer/how-to-profile-workload.html">Neuron Explorer profiling guide</a></li> |
| 173 | + <li><a href="https://awsdocs-neuron.readthedocs-hosted.com/en/latest/tools/neuron-explorer/get-started.html#launch-the-api-and-ui-spa-servers">Neuron Explorer get-started guide</a></li> |
| 174 | + <li><a href="https://awsdocs-neuron.readthedocs-hosted.com/en/latest/nki/deep-dives/use-neuron-profile.html">Profile a NKI kernel</a></li> |
| 175 | + <li><a href="https://awsdocs-neuron.readthedocs-hosted.com/en/latest/nki/api/nki.isa.html#id1">NKI ISA documentation</a></li> |
| 176 | + <li><a href="https://awsdocs-neuron.readthedocs-hosted.com/en/v2.26.1/nki/tutorials/matrix_multiplication.html">NKI matrix multiplication tutorial</a></li> |
| 177 | + <li><a href="https://github.com/aws-neuron/nki-samples/tree/main/src/nki_samples/tutorials/attention_fwd_performance">NKI attention forward performance tutorial</a></li> |
| 178 | + <li><a href="https://utah.zoom.us/rec/share/yG3LRDlyn7ujehS_wbYnP8mFf4nUjFsDMUrChGuZBhY0JLNF4bhZA1hgk-80jQtM.1ILNdU0UfNRiiiY4?startTime=1770060311000">AWS / Trainium recorded session</a></li> |
| 179 | + </ul> |
| 180 | + </section> |
| 181 | + |
| 182 | + <section id="systems" class="panel alt"> |
| 183 | + <h2>CHPC, Profiling, and System Access</h2> |
| 184 | + <ul> |
| 185 | + <li><a href="https://www.chpc.utah.edu/documentation/software/slurm-gpus.php">CHPC GPU usage guide</a></li> |
| 186 | + <li><a href="http://granite.chpc.utah.edu">Granite CHPC host</a></li> |
| 187 | + <li><a href="http://notchpeak.chpc.utah.edu">Notchpeak CHPC host</a></li> |
| 188 | + <li><a href="https://developer.nvidia.com/nsight-systems">Nsight Systems download page</a></li> |
| 189 | + </ul> |
| 190 | + </section> |
| 191 | + |
| 192 | + <section class="panel alt"> |
| 193 | + <h2>Video and Media Links</h2> |
| 194 | + <ul> |
| 195 | + <li><a href="https://youtu.be/Uno_XhtkT5E?si=N4m2nYUUykKaRLIH">Course-linked YouTube video</a></li> |
| 196 | + <li><a href="https://youtu.be/hkgWi0oN_L8?si=Cl_H2Se2AlYvKmqN">Additional linked YouTube video</a></li> |
| 197 | + <li><a href="https://graphics.pixar.com/library/RenderManXPU/">RenderMan XPU media/library page</a></li> |
| 198 | + </ul> |
| 199 | + </section> |
| 200 | + </main> |
| 201 | + |
| 202 | + <footer> |
| 203 | + <p><a href="index.html">Back to CS6969 homepage</a></p> |
| 204 | + </footer> |
| 205 | +</body> |
| 206 | +</html> |
0 commit comments