Conversation
…where (with Andrea Valassi)
…ests on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ts on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…(with DanieleM)
Remove the custom __ARM_NEON__ with two extra underscores
Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
Results on lxplus-arm:
for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1 \
| \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 2.425635e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
Internal loops fptype_sv = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=YES]
EvtsPerSec[MECalcOnly] (3a) = ( 3.261666e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
…ith DanieleM) Remove the custom __ARM_NEON__ with two extra underscores Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…ple (with DanieleM)
Results on an Apple M1 (thanks Olivier!)
for avx in none sse4; do ./build.${avx}_m_inl0_hrd0/check_cpp.exe -p 1024 256 1
| \egrep '(EvtsPerSec\[MECalcOnly\]|MeanMatrixElemValue|fptype_sv)'; done
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[MECalcOnly] (3a) = ( 5.535410e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
Internal loops fptype_sv = VECTOR[2] ('sse4': ARM NEON, 128bit) [cxtype_ref=NO]
EvtsPerSec[MECalcOnly] (3a) = ( 8.327442e+05 ) sec^-1
MeanMatrixElemValue = ( 2.080788e+00 +- 6.803789e-03 ) GeV^0
…e (with DanieleM)
…ts on aarch64 (with DanieleM) This fixes a hang in the testMisc tests on aarch64 in sqrtNewtonRaphson (madgraph5#1064) (testMisc -> constexpr_tan -> constexpr_tan_quad -> constexpr_cos_quad -> constexpr_sqrt -> sqrtNewtonRaphson) It uses the same workaround previously adopted for avoiding testMisc hangs when running valgrind (madgraph5#906)
…ith DanieleM) Remove the custom __ARM_NEON__ with two extra underscores Use 'g++ -march=armv8.2-a+simd -E -dM - < /dev/null | grep ARM' to check
…e (with DanieleM)
…ODEGEN commits Revert "[fix-arm-support] gg_tt.mad: undefine__ARM_NEON for cppnone on arm/apple (with DanieleM)" This reverts commit d427fcf. Revert "[fix-arm-support] gg_tt.mad: use builtin __ARM_NEON for aarch64 simd (with DanieleM)" This reverts commit 82645d2. Revert "[fix-arm-support] gg_tt.mad: use higher tolerance for constexpr_tan tests on aarch64 (with DanieleM)" This reverts commit 625bca8. Revert "Regenerate processes" This reverts commit 1e1c899.
…ts (#2) on aarch64 (with DanieleM)
…_tan tests (#2) on aarch64 (with DanieleM)
…using constexpr_sqrt on aarch64 (with DanieleM)
…l tests using constexpr_sqrt on aarch64 (with Danie>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.