Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenyq
iGEMMgen
Repository
Branches
Overview
Active
Stale
All
igemm_codegen_wrw
7635f098
·
update ignore file
·
Jul 20, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
wrw_reduction
51e12271
·
right code for 32x32x4 kernel
·
Jul 20, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
wrw_xdlops_with_reduction
e30f2c50
·
add an xdlops version for 128x128 block
·
Aug 26, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
gtc_wrw_2x2x2
491eca3d
·
remove debug code
·
Oct 14, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
fp32_multi_k
04247997
·
add some configs and script for multi k instruction
·
Oct 15, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mfma_fwd
bf3bf521
·
Merge branch 'master' into mfma_fwd
·
Oct 20, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
config-automatical-generation
b0442012
·
fix some format issue
·
Nov 06, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
master_fix_fwd_step_repeat_both_2x2
62a13e98
·
fix bug when fwd's step and repeat are both 2x2
·
Nov 30, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
wrw_b_padding_bug_fix
3fa80e50
·
fix bug in wrw when padding b
·
Dec 07, 2020
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
master
default
protected
468826f8
·
minor change the index to size_t (#76)
·
Jan 19, 2021
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mfma_fp16
cefd8b45
·
add oob feature
·
Jan 19, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
nhwc_fwd
88a3c36b
·
more add
·
Jan 21, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
wider_tensor_size
71c6256c
·
seperate block idx with thread idx into sgpr and vgpr, when compute n1b transform
·
Jan 21, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mfma_fp16_bwd
cd37ab80
·
Update to the use of pack_d0 in get_macro_shared_store
·
Jan 21, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
split_4G_pices_per_batch
2f64e3b4
·
fwd support split batch
·
Jan 22, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mfma_fp16_wrw
035c7818
·
fix bug for step1 repeat1's double buffer case
·
Jan 22, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar