Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
mirrors
ROCmSoftwarePlatform
apex
Repository
Branches
Overview
Active
Stale
All
dev/hubertlu/fused_adam_cuda
a6e45fb1
·
Revert "Add distributed_fused_adam_v2 and _v3 back"
·
Sep 10, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/faster_build
075c3a9a
·
Remove redundant CUDAExtension import's
·
Sep 20, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
luise/fused_lars
d6e99c4a
·
Update primitive fused_lars optimizer, working for resnet50 with NHWC/NCHW
·
Sep 20, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/index_mul_2d_fix
8bafce60
·
Typo
·
Sep 22, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/unskip_82
bb1ef504
·
Fix TestFusedAdam tests in test_fused_optimizer.py
·
Nov 16, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
fused_sgd_fix
8465d2bd
·
Add an unit test script for nhwc fused_sgd
·
Nov 16, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/fused_dense_debug
merged
d63b5d1f
·
Add fused_dense in the extension unit test script
·
Dec 10, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/benchmark
ddf4689e
·
Add a benchmark script for Torch LayerNorm and Apex FusedLayerNorm
·
Dec 16, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
c17_update
fd42aaac
·
Updates to the code
·
Dec 21, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/run_transformer
72f978c2
·
Add some test folders for those ROCm does not support
·
Dec 30, 2022
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
dev/hubertlu/gbn_full_support
274033b1
·
Attempt to add unit test script for BatchNorm2d_NHWC with bn_group > 1
·
Jan 17, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
luise/gbn_optimization
b30d5f5b
·
GroupBN: Use C_ELEMENTS_PER_CTA=64 for BN_add_relu kernels for ~10% E2E improvement of resnet50
·
Feb 07, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
grid_optimization
ae80713b
·
Updating all files related to L2norm since test_fuzz...
·
Feb 16, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
six_torch_tensor_fix
d96ab1ce
·
use `torch.tensor` to create a tensor with initializer values (#1588)
·
Feb 28, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
luise/fused_lars_checkin
b9e2f708
·
Add flow of using nesterov in FusedLARS
·
Mar 21, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
rccl_update
bfa37470
·
Update rccl header include path
·
Mar 30, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
kk/fused-dense-hipblaslt
dcb7db12
·
enable gemm_bias_lt in fused_dense
·
Apr 12, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
fastlayernorm
fb79a52f
·
Merge branch 'master' into fastlayernorm
·
May 04, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
add_toml_file
b3c7da55
·
Adding pyproject.toml file
·
Jun 20, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
hipblas_support
8345dcf0
·
Changes to support hipblas migration
·
Aug 11, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
Prev
1
…
5
6
7
8
9
10
Next