Skip to content

Overlapped tensor cores#9

Open
Trinity-142 wants to merge 1 commit into
n00bmasters:masterfrom
Trinity-142:tensor-cores
Open

Overlapped tensor cores#9
Trinity-142 wants to merge 1 commit into
n00bmasters:masterfrom
Trinity-142:tensor-cores

Conversation

@Trinity-142
Copy link
Copy Markdown
Collaborator

3072x3072x3072
Warp Matrix Multiply-Accumulate GPU multiplication duration: ~3.53244msms
TFLOPS: 16.41

3071x3071x3071
Warp Matrix Multiply-Accumulate GPU multiplication duration: ~5.87885msms
TFLOPS: 9.86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant