Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
mirrors
ROCmSoftwarePlatform
apex
Repository
Branches
Overview
Active
Stale
All
dev/hubertlu/rocblas_backward_compatible
f884c100
·
Fix some bugs
·
Apr 06, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
spumma/fused-larc
58d89574
·
Update LARCFunctor's metadata argument type
·
Apr 05, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mha_rocblas_alt_impl
e6df1c08
·
Fix for rocblas_alt_impl flag in MHA
·
Mar 24, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mlp_rocblas_alt_impl
c6aad5dd
·
Refactor rocblas_alt_impl implementation and only use it for backprop
·
Mar 23, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/athitten/mha_roblas_alt_impl_flag
2be68b18
·
Use ifdef for rocblas_gemm_flags_fp16_alt_impl to target at various AMD hardware
·
Mar 17, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/gbn_optim_new
245be05d
·
Re-enable gbn module in Apex for ROCm
·
Mar 01, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/gbn_optim
8aeffee3
·
Revert rectified, stg, and stg_stream
·
Feb 26, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/athitten/denorm_fix_bwd_gemms
046a4291
·
Add denorm fix for multihead_attn bwd rocblas calls
·
Jan 29, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
cherry_pick_b2fdf9c
e3f2ecb7
·
Cherry-pick
b2fdf9c4
from upstream Apex and resolve conflicts
·
Jan 27, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
pru_319886
a2ed1b0c
·
Fix for compilation failure
·
Jan 26, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
fused_layer_norm_optim
fea95106
·
Optimize cuComputePartGradGammaBeta for AMD GPUs
·
Jan 08, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
gbn-pytorch-channels-last
4c29c199
·
Add support for memory_format=torch.channels_last in GBN
·
Dec 15, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
gbn-channels-last
0435d4c4
·
Fix the memory-format API
·
Dec 15, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
IFU-master-2021-12-08
merged
68364b49
·
Conditionally define autocast_dtypes for different torch versions
·
Dec 15, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/skip_unittest
5fbb3515
·
Modify the test skipping messages
·
Dec 14, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
IFU-master-2021-10-15
merged
fec3141c
·
Replace THCudaCheck with C10_CUDA_CHECK
·
Dec 07, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
enable_all_supported_extensions
b4629e2e
·
Need to clear all cmdline args so setup.py doesn't complain
·
Dec 03, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/unit_tests
merged
2228f1bf
·
Merge remote-tracking branch 'origin/master' into dev/hubertlu/unit_tests
·
Dec 03, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/athitten/distributed_lamb
merged
f3868524
·
Enable Distributed FusedLAMB
·
Nov 19, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
IFU-master-2021-11-09
309363f5
·
Replace THCublasCheck with TORCH_CUDABLAS_CHECK
·
Nov 10, 2021
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
Prev
1
2
3
4
5
6
7
…
10
Next