Skip to content

[libc] Add AMDGPU Timing Utils #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1,094 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1094 commits
Select commit Hold shift + click to select a range
954b692
[flang] Allow derf as alternate spelling for erf (#95784)
DavidTruby Jun 25, 2024
8ad32ce
[X86] Add sub-feature zu (zero upper) for APX
KanRobert Jun 25, 2024
7ea63b9
[LoongArch][test] Remove the FIXME in psabi-restricted-scheduling.ll …
SixWeining Jun 25, 2024
b0e9b00
[NVPTX] Make nvptx mma instructions convergent. (#96521)
weiweichen Jun 25, 2024
4e0a0ea
[X86] Fix test Clang::CodeGen/builtin-cpu-supports.c failure
KanRobert Jun 25, 2024
9267f8f
[clang-format] Add option to remove leading blank lines (#91221)
sstwcw May 6, 2024
437366b
Adjust MSVC version range for ARM64 build performance regression (#90…
lxbndr Jun 25, 2024
c393121
[libc++] Remove Windows-specific configuration from libcxx/test/CMake…
ldionne Jun 25, 2024
8348d72
[clang-tidy] Fix assert in performance-unnecessary-copy-init. (#96506)
legrosbuffle Jun 25, 2024
bd488c1
[RISCV] Use StringRef::starts_with(char) instead of starts_with(Strin…
topperc Jun 25, 2024
8599629
[CodeGen][NewPM] Port machine post dominator tree analysis to new pas…
paperchalice Jun 25, 2024
62d44fb
[MC] Remove setUseAssemblerInfoForParsing(false) workarounds
MaskRay Jun 25, 2024
f1e0657
[mlir][Transforms] Dialect conversion: Simplify handling of dropped a…
matthias-springer Jun 25, 2024
41f8e6d
[RISCV][GISel] Fix test case order in fp-arith.mir. NFC
topperc Jun 25, 2024
4c91b49
[clangd] Fix the build broken (NFC)
DamonFool Jun 25, 2024
efa8463
[VectorCombine] Add free concats to shuffleToIdentity. (#94954)
davemgreen Jun 25, 2024
f019581
[SmallPtrSet] Add remove_if() method (#96468)
nikic Jun 25, 2024
fa20184
[C++20] [Modules] [Serialization] Don't reuse type ID and identifier …
ChuanqiXu9 Jun 25, 2024
8153773
[clang][Interp] Fix returning primitive non-blockpointers
tbaederr Jun 25, 2024
174f80c
[DomTree] Avoid duplicate hash lookups in runDFS() (NFCI) (#96460)
nikic Jun 25, 2024
01fb529
Fix bazel after 18cf1cd92b554ba0b870c6a2223ea4d0d3c6dd21
ftynse Jun 25, 2024
7f1a744
[TailDup][MachineSSAUpdater] Let RewriteUse insert a COPY when needed…
bjope Jun 25, 2024
11e12bd
[NFC] [GWP-ASan] Rename Check() to check() (#96605)
hctim Jun 25, 2024
d782119
[Analysis] Use SmallPtrSet::remove_if (NFC)
nikic Jun 25, 2024
2d84e0f
[RISCV] Add scheduling model for Syntacore SCR3 (#95427)
asi-sc Jun 25, 2024
8395f9c
[lldb/DWARF] Remove parsing recursion when searching for definition D…
labath Jun 25, 2024
f6aa508
[SPIR-V]: Fix creation of constants of array types in SPIRV Backend (…
VyacheslavLevytskyy Jun 25, 2024
0bd9c49
[AArch64][SVE] optimisation for SVE load intrinsics with no active la…
Lukacma Jun 25, 2024
919b1ec
[flang][debug] Support pointer type. (#96153)
abidh Jun 25, 2024
5feb32b
[AMDGPU] Extend readlane, writelane and readfirstlane intrinsic lower…
vikramRH Jun 25, 2024
f9795f3
[GlobalISel] Add build methods for FP environment intrinsics (#96607)
spavloff Jun 25, 2024
16d02cd
[libc++] Use __is_nothrow_destructible (#95766)
philnik777 Jun 25, 2024
29f4a05
[SetOperations] clang-format header (NFC)
nikic Jun 25, 2024
f09b024
[clang] Remove a stale FIXME
hokein Jun 25, 2024
9d45077
[VPlan] Iterate over VPlans to get VFs to compute cost for (NFCI).
fhahn Jun 25, 2024
eeb0884
[LoopUnroll] Use poison instead of undef for preheader value
nikic Jun 25, 2024
a2e9157
[LV] Make create-induction-resume.ll more robust by adding store.
fhahn Jun 25, 2024
37c736e
[LoopUnroll] Use poison instead of undef for another preheader value
nikic Jun 25, 2024
cc8fdd6
[Xtensa] Lower GlobalAddress/BlockAddress/JumpTable (#95256)
andreisfr Jun 25, 2024
4acc8ee
[SCCP] Generate test checks (NFC)
nikic Jun 25, 2024
16bb8c1
[SCCP] Use poison instead of undef when zapping returns
nikic Jun 25, 2024
35eef9f
[Reassociate] Use poison instead of undef for dummy operands (NFCI)
nikic Jun 25, 2024
65f9601
[NFC][lld][ELF] Remove unused `sec` param of `ObjFile<ELFT>::getReloc…
kovdan01 Jun 25, 2024
bfad875
[LoongArch] Ensure PseudoLA* can be hoisted (#94723)
heiher Jun 25, 2024
9952e00
[VPlanTest] Use poison instead of undef for dummy values (NFC)
nikic Jun 25, 2024
68efc50
[VectorBuilderTest] Use poison instead of undef for dummy values (NFC)
nikic Jun 25, 2024
9acb533
[clang][Driver] Add HIPAMD Driver support for AMDGCN flavoured SPIR-V…
AlexVlx Jun 25, 2024
a46a2c2
[X86] Lower vXi8 multiplies using PMADDUBSW on SSSE3+ targets (#95690)
RKSimon Jun 25, 2024
5ed5d72
[mlir][ArmSME] Lower multi-tile stores to a single loop (#96187)
MacDue Jun 25, 2024
6c9916d
[libc] Configure CMAKE_REQUIRED_FLAGS so the GPU can use flag checks …
jhuber6 Jun 25, 2024
0f111ba
LoopInfo: introduce Loop::getLocStr; unify debug output (#93051)
artagnon Jun 25, 2024
8263bec
[SLP] Use poison instead of undef in reorderScalars() (#96619)
nikic Jun 25, 2024
952bdaa
[flang][OpenMP] Fix copyprivate allocatable/pointer lowering (#95975)
luporl Jun 25, 2024
c436649
[AArch64] Remove all instances of the 'hasSVEorSME' interfaces. (#96543)
sdesmalen-arm Jun 25, 2024
c5aa983
[InstSimplify] Fold all poison phi to poison instead of undef
nikic Jun 25, 2024
4780dc3
[InstCombine] Add poison variant to phi test (NFC)
nikic Jun 25, 2024
abc8c4b
[LoopVectorize] Generate test checks (NFC)
nikic Jun 25, 2024
edbe698
[libc][math][c23] Add f16divf C23 math function (#96131)
overmighty Jun 25, 2024
eabe091
[bazel] Add missing dependency for 5ed5d723db58f7556782427444642d0571…
d0k Jun 25, 2024
70cfece
Revert "[clang-format] Add option to remove leading blank lines (#912…
sstwcw Jun 25, 2024
c69ea04
Revert "[clangd] Fix the build broken (NFC)"
DamonFool Jun 25, 2024
fd62906
[libc++] Fix incorrect overflow checking in std::lcm (#96310)
ldionne Jun 25, 2024
44c9a26
[Mem2Reg] Add tests with !noundef (NFC)
nikic Jun 25, 2024
c9f083a
[Clang][AMDGPU] Add builtins for instrinsic `llvm.amdgcn.raw.ptr.buff…
shiltian Jun 25, 2024
1462605
[Analysis] Use range-based for loops (NFC) (#96587)
kazutakahirata Jun 25, 2024
2dd4167
[LoopVectorize][AArch64] Add limited support for scalable vectorisati…
david-arm Jun 25, 2024
54b61ad
[NFC][clang] Replace unreachable code in literal processing with asse…
mikerice1969 Jun 25, 2024
eb12d9b
[libc++] Remove the allocator<const T> extension (#96319)
ldionne Jun 25, 2024
d6c7410
AMDGPU: Remove an outdated TODO (#96446)
nhaehnle Jun 25, 2024
7e9b49f
AMDGPU: Add plumbing for private segment size argument (#96445)
nhaehnle Jun 25, 2024
13ed349
[AMDGPU][NFC] Rename AMDGPUVariadicMCExpr to AMDGPUMCExpr. (#96618)
kosarev Jun 25, 2024
fb463e1
[C23] Remove WG14 N2379 from the C status page
AaronBallman Jun 25, 2024
c6973ad
[Mem2Reg] Generate non-terminator unreachable for !noundef undef (#96…
nikic Jun 25, 2024
de528ff
[Flang] Extracting internal constants from scalar literals (#73829)
Leporacanthicus Jun 25, 2024
2cf1975
[MLIR][OpenMP]Add order-modifier support to Order clause (#93805)
harishch4 Jun 25, 2024
a404529
[SPIRV] Add definitions for NonSemantic debug info (#95530)
bwlodarcz Jun 25, 2024
ec9ce89
[LoopVectorize] Fix build issue caused by #95920 (#96647)
david-arm Jun 25, 2024
2274c66
[libc++] Use _If for conditional_t (#96193)
philnik777 Jun 25, 2024
ed1273d
[libc++] change the visibility of libc++ header to public in libcxx m…
RichardLuo0 Jun 25, 2024
bb5ab1f
[𝘀𝗽𝗿] initial version
shawbyoung Jun 24, 2024
020f69c
Added opts::Lite to RewriteInstance
shawbyoung Jun 25, 2024
c097e64
Revert "Added opts::Lite to RewriteInstance"
shawbyoung Jun 25, 2024
0e11a7e
[EarlyCSE] Add test with noundef load of undef (NFC)
nikic Jun 25, 2024
79e8a59
[libc++] Move allocator assertion into allocator_traits (#94750)
huixie90 Jun 25, 2024
7f10ed6
[tsan] Fix dead lock when starting StackDepot thread (#96456)
vitalybuka Jun 25, 2024
dddef9d
[RISCV] Add FPR16 regbank and start legalizing f16 operations for Zfh…
topperc Jun 25, 2024
8a46bbb
[Clang] Remove preprocessor guards and global feature checks for NEON…
Lukacma Jun 25, 2024
8c11d37
[libc++] P3029R1: Better `mdspan`'s CTAD - `std::extents` (#89015)
xiaoyang-sde Jun 25, 2024
902952a
Revert "[𝘀𝗽𝗿] initial version"
shawbyoung Jun 25, 2024
731db06
[libc++] Get the GCC build mostly clean of warnings (#96604)
philnik777 Jun 25, 2024
2604830
Add support for __builtin_verbose_trap (#79230)
ahatanak Jun 25, 2024
bb075ee
[libc++] LWG3382: NTTP for `pair` and `array` (#85811)
JMazurkiewicz Jun 25, 2024
889f3c5
AMDGPU: Handle legal v2bf16 atomicrmw fadd for gfx12 (#95930)
arsenm Jun 25, 2024
b7768c5
[clang][Interp][NFC] Use delegate() to delegate to only initlist item
tbaederr Jun 25, 2024
e951bd0
Reapply PR/87550 (again) (#95571)
oontvoo Jun 25, 2024
dca49d7
[libc][arm32] define argc type and stack alignment (#96367)
nickdesaulniers Jun 25, 2024
fb07afe
[BPF] Avoid potential long compilation time without -g (#96575)
yonghong-song Jun 25, 2024
ac1e22f
[mlir][vector] Generalize folding of ext-contractionOp to other types…
raikonenfnu Jun 25, 2024
580343d
[clang][Interp][NFC] Destroy InitMap when moving contents to DeadBlock
tbaederr Jun 25, 2024
c9529f7
[mlir] Drop outermost dims in slice rank reduction inference (#95020)
Max191 Jun 25, 2024
05ca207
[C23] Update status page regarding FLT_MAX_EXP
AaronBallman Jun 25, 2024
d7dd778
[clang-doc] update install path to share/clang-doc instead of share/c…
PeterChou1 Jun 25, 2024
dbd5c78
[clang-doc] Remove stdexecpt from clang-doc test (#96552)
PeterChou1 Jun 25, 2024
f0f774e
[sanitizer] Rename DEFINE_REAL_PTHREAD_FUNCTIONS (#96527)
vitalybuka Jun 25, 2024
de7c139
[SLP] NFC. Refactor and add getAltInstrMask help function. (#94709)
HanKuanChen Jun 25, 2024
aaf50bf
[AMDGPU] Disallow negative s_load offsets in isLegalAddressingMode (#…
jayfoad Jun 25, 2024
5e2beed
[C23] Move WG14 N2931 to the TS18661 section
AaronBallman Jun 25, 2024
0b049ce
[tsan] Test `__tsan_test_only_on_fork` only on Mac (#96597)
vitalybuka Jun 25, 2024
4c87212
[libc][thumb] support syscalls from thumb mode (#96558)
nickdesaulniers Jun 25, 2024
cd2bac8
[nfc][tsan] Better name for locking functions (#96598)
vitalybuka Jun 25, 2024
0258a60
[nfc][tsan] Clang format includes (#96599)
vitalybuka Jun 25, 2024
5c9513a
[NVPTX] cap param alignment at 128 (max supported by ptx) (#96117)
AlexMaclean Jun 25, 2024
70fb1e3
Reland [mlir][Target] Improve ROCDL gpu serialization API (#96198)
fabianmcg Jun 25, 2024
c0dc134
[tsan] Lock/Unlock allocator and stacks on fork (#96600)
vitalybuka Jun 25, 2024
39a0aa5
[SelectionDAG] Lower llvm.ldexp.f32 to ldexp() on Windows. (#95301)
efriedma-quic Jun 25, 2024
928b780
[WebAssembly] Implement trunc_sat and convert instructions for f16x8.…
brendandahl Jun 25, 2024
a790279
[flang] Add basic -mtune support (#95043)
AlexisPerry Jun 25, 2024
a086933
[C11] Remove WG14 N1537 from the status page
AaronBallman Jun 25, 2024
21ab32e
[lldb][LibCxx] Move incorrect nullptr check (#96635)
Michael137 Jun 25, 2024
b9353f7
[LinkerWrapper][NFC] Simplify StringErrors (#96650)
jhuber6 Jun 25, 2024
fef144c
Revert "[llvm] Use llvm::sort (NFC) (#96434)"
kazutakahirata Jun 25, 2024
aec735c
[Flang][OpenMP][MLIR] Fix common block mapping for regular and declar…
agozillon Jun 25, 2024
b003c60
[mlir][arith] Match folding of `arith.remf` to `llvm.frem` semantics …
ubfx Jun 25, 2024
a255ece
XFAIL llvm/test/DebugInfo/attr-btf_type_tag.ll on AIX (#96677)
nivethakuruparan Jun 25, 2024
ec2fb59
[BOLT][docs] Add Linux kernel optimization guide (#96669)
maksfb Jun 25, 2024
8dd9494
Revert "[flang] Add basic -mtune support" (#96678)
tarunprabhu Jun 25, 2024
7e77353
[clang][test] Avoid writing to a potentially write-protected dir (#96…
karka228 Jun 25, 2024
a55dc1d
[SLP][NFC]Add a test with the incorrect casting of the sext/zext alte…
alexey-bataev Jun 25, 2024
600ff28
[mlgo] add 2 new features whether caller/callee is `available_externa…
mtrofin Jun 25, 2024
bd5b775
__asan_register_elf_globals: properly check the "no instrumented glob…
MaskRay Jun 25, 2024
80114a7
[ScheduleDAG] Remove obsolete VReg2SUnitMap. NFC (#96559)
bjope Jun 25, 2024
699cd9a
[llvm-dwp] Fix merging of debug_str_offsets with multiple contributio…
molar Jun 25, 2024
228c2e1
[SLP]Fix incorrect promotion of nodes before shuffling.
alexey-bataev Jun 25, 2024
3c24eb3
[LLVM][MIR] Support parsing bfloat immediates in MIR parser (#96010)
stephenchouca Jun 25, 2024
ef05b03
[libc][math][c23] Add MPFR exhaustive test for fmodf16 (#94656)
overmighty Jun 25, 2024
88f80ae
[libc][math] Implement double precision cos correctly rounded to all …
lntue Jun 25, 2024
04c2785
[MC,COFF] Change how we handle section symbols
MaskRay Jun 25, 2024
d264514
Enforce parameter order in f16 call, flipped call unit tests (#96258)
akshayrdeodhar Jun 25, 2024
0280f97
[SLP]Fix PR95925: extract vectorized index of the potential buildvect…
alexey-bataev Jun 25, 2024
dff6871
[RISCV][GISel] Support G_FCONSTANT for Zfh.
topperc Jun 25, 2024
7e59b20
[SCEV] Support addrec in right hand side in howManyLessThans (#92560)
mrdaybird Jun 25, 2024
edf5782
[TableGen] Check for duplicate register tuple definitions. (#95725)
nvjle Jun 25, 2024
8bba070
AMDGPU: Expand testing of atomicrmw fmin/fmax lowering
arsenm Jun 25, 2024
4f80f36
AMDGPU: Add new metadata and expand atomicrmw fadd expansion tests
arsenm Jun 25, 2024
e214ed9
[libc][arm] move setjmp+longjmp to fullbuild-only entrypoints (#96708)
nickdesaulniers Jun 25, 2024
847235b
[RISCV] Add DAG combine to turn (sub (shl X, 8), X) into orc.b (#96680)
topperc Jun 25, 2024
0d53366
APFloat: Add minimumnum and maximumnum (#96304)
wzssyqa Jun 25, 2024
54ca5a8
[libc][fixedvector] Add const_iterator begin/end (#96714)
PiJoules Jun 25, 2024
e6c2216
Add ConstantRangeList::unionWith() and ::intersectWith() (#96547)
haopliu Jun 25, 2024
6b29965
[clang-tidy] align all help message in run-clang-tidy (#96199)
HerrCai0907 Jun 26, 2024
ef86451
[mlir] Remove `*` from generic Linalg/Vector rules in CODEOWNERS (#96…
dcaballe Jun 26, 2024
a4fef26
[X86][FixupSetCC] Substitute setcc + zext pair with setzucc if possib…
KanRobert Jun 26, 2024
55e60c3
[flang] Fix BoxAnalyzer.h for LLVM_ENABLE_EXPENSIVE_CHECKS (#96675)
dpalermo Jun 26, 2024
811e505
[llvm][CodeGen] Update checking method of loop-carried phi in window …
huaatian Jun 26, 2024
0fe82ea
gn build: Add missing source files to hwasan library.
pcc Jun 26, 2024
96b1f88
[ORC] Remove redundant locking of session mutex, update method name.
lhames Jun 15, 2024
896dd32
[ORC] Fix block dependence calculation in ObjectLinkingLayer.
lhames Jun 26, 2024
b347a72
[MC/DC][Coverage] Make tvbitmapupdate capable of atomic write (#96042)
chapuni Jun 26, 2024
f71f95d
[ELF] Rename IsRela to HasAddend
MaskRay Jun 26, 2024
174d353
[X86] Fix description for pass X86FixupSetCC.cpp, NFCI
KanRobert Jun 26, 2024
21bfc0e
Revert "[X86] Fix description for pass X86FixupSetCC.cpp, NFCI"
KanRobert Jun 26, 2024
9253ac2
[X86] Fix description for pass X86FixupSetCC.cpp, NFCI
KanRobert Jun 26, 2024
89d8df1
CodeGen, IR: Add target-{cpu,features} attributes to functions create…
pcc Jun 26, 2024
35f7b60
[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowe…
vikramRH Jun 26, 2024
cb3469a
Removed header and validated on new windows machine (#96724)
Jlalond Jun 26, 2024
2ee2b6a
[Coroutines] Clear FirstVLALoc in time
ChuanqiXu9 Jun 26, 2024
c01ce79
[mlir][Transforms][NFC] Dialect Conversion: Move argument materializa…
matthias-springer Jun 26, 2024
a159b36
Bufferization with ControlFlow Asserts (#95868)
mccowanzhang Jun 26, 2024
1822e31
[WebAssembly] Rename target-features.ll (#96716)
aheejin Jun 26, 2024
8426586
Revert "Reformat", due to wrong version of clang-format.
chapuni Jun 26, 2024
20683de
[X86][CodeGen] Not promote some binary ops from i16 to i32 if we have…
KanRobert Jun 26, 2024
eaae63d
"[libc++] Try again LWG3233 Broken requirements for shared_ptr conver…
huixie90 Jun 26, 2024
569faa4
Add documentation for `__builtin_object_size`. (#96573)
zygoloid Jun 26, 2024
28a3fbb
[X86][test] Pre-commit test for disabling NDD2NonNDD transform for 8/…
KanRobert Jun 26, 2024
6c4c44b
[SetOperations] Support set containers with remove_if (#96613)
nikic Jun 26, 2024
515e048
[InstCombine] Simplify commutative matchers (NFC) (#96665)
AZero13 Jun 26, 2024
dbd0c03
[MC][DWARF][NFC] Drop CIEKey map (#96075)
aengelke Jun 26, 2024
54cb5ca
[libc++][NFC] Simplify pair a bit (#96165)
philnik777 Jun 26, 2024
be00190
[TII][X86] Do not schedule frame-setup/frame-destory instructions (#9…
HaohaiWen Jun 26, 2024
1dbc2aa
[PassBuilder] Parse machine function analyses inside require/invalida…
paperchalice Jun 26, 2024
8681bb8
[LV] Add additional test coverage for cost modeling.
fhahn Jun 26, 2024
5b4000d
[VectorUtils] Add llvm::scaleShuffleMaskElts wrapper for narrowShuffl…
RKSimon Jun 26, 2024
90e4eb8
[clang][Interp][NFC] Add more source ranges to diagnostics
tbaederr Jun 25, 2024
f782ff8
[C API] Add getters for Target Extension Types to C API (#96447)
Benjins Jun 26, 2024
16f3492
[libc++] restrict the expected conversion constructor not compete aga…
huixie90 Jun 26, 2024
cca4f54
[InstCombine][X86] Avoid repeated getType() calls when folding blendv…
RKSimon Jun 26, 2024
c9fc960
[CLANG][LLVM][AArch64]SME2.1 intrinsics for MOVAZ tile to 2/4 vectors…
CarolineConcatto Jun 26, 2024
847d046
[clang][Interp] Fix comparing one-past-the-end pointers
tbaederr Jun 26, 2024
d6f906e
[SlotIndexes] Use simple_ilist instead of ilist. NFC. (#96747)
jayfoad Jun 26, 2024
86860be
[libc] Make 'rand()' thread-safe using atomics instead of TLS (#96692)
jhuber6 Jun 26, 2024
317f782
[ADT] Return bool from SmallPtrSet::remove_if()
nikic Jun 26, 2024
92715cf
[DAG] expandAVG - attempt to extend to a wider integer type for the a…
RKSimon Jun 26, 2024
6f582b7
[SLP][NFC]Remove extra check for VU.
alexey-bataev Jun 26, 2024
4c9b71d
[GlobalISel][ARM] Legalze set_fpmode and get_fpmode (#96467)
spavloff Jun 26, 2024
b6a94b6
[OMPIRBuilder] Use SmallPtrSet::remove_if() (NFC)
nikic Jun 26, 2024
519dbc6
[WasmExceptionInfo] Use SmallPtrSet::remove_if() (NFC)
nikic Jun 26, 2024
177cbd1
[Clang][SME2.1] Add REQUIRES: aarch64-registered-target to test
CarolineConcatto Jun 26, 2024
d42b392
[VectorUtils] Use SmallPtrSet::remove_if() (NFC)
nikic Jun 26, 2024
6f8efc7
[PhaseOrdering][X86] Regenerate pr67803.ll
RKSimon Jun 26, 2024
dfe80a7
[clang][dataflow] Teach `AnalysisASTVisitor` that `typeid()` can be e…
martinboehme Jun 26, 2024
85f47fd
[clang][nullability] Improve modeling of `++`/`--` operators. (#96601)
martinboehme Jun 26, 2024
e24a212
[clang][OpenMP] Shorten directive classification in ParseOpenMP (#94691)
kparzysz Jun 26, 2024
352a836
[InstCombine] Canonicalize non-i8 gep of mul to i8 (#96606)
davemgreen Jun 26, 2024
b6240c3
[AArch64] Add ability to list extensions enabled for a target (#95805)
pratlucas Jun 26, 2024
00ae6bb
[ARM] Regenerate MIR test (NFC)
nikic Jun 26, 2024
6e96e5a
Revert "[clang][dataflow] Teach `AnalysisASTVisitor` that `typeid()` …
martinboehme Jun 26, 2024
e5e0d87
[clang][Interp] Implement CXXStdInitializerListExprs
tbaederr Jun 26, 2024
161e168
[llvm-readobj][ELF] Test multivalued rpath entries and alter the outp…
feg208 Jun 26, 2024
b579aac
Revert "[AArch64] Add ability to list extensions enabled for a target…
pratlucas Jun 26, 2024
dead8f8
[clang][Interp] Pointers into unknown-size arrays are never one-past-end
tbaederr Jun 26, 2024
019f525
[clang-tidy] add fixhint for misc-use-internal-linkage (#96203)
HerrCai0907 Jun 26, 2024
fd35a92
[lldb] fix(lldb/**.py): fix comparison to True/False (#94039)
e-kwsm Jun 26, 2024
5861145
[lldb] fix(lldb/**.py): fix comparison to None (#94017)
e-kwsm Jun 26, 2024
3e1ebd7
[GlobalISel] Add support for lowering byref attribute
darkbuck Jun 26, 2024
30b93db
[Memprof] Adds the option to collect AccessCountHistograms for mempro…
mattweingarten Jun 26, 2024
17eaa23
[AMDGPU] MCExpr-ify AMDGPU HSAMetadata (#94788)
JanekvO Jun 26, 2024
b7762f2
Bump required minimal Z3 version from 4.7.1 to 4.8.9 (#96682)
steakhal Jun 26, 2024
2612765
[NVPTX] Fold (add (select 0, (mul a, b)), c) -> (select c, (mad a, b,…
AlexMaclean Jun 26, 2024
1918369
[RISCV][GISel] Add missing fclass tests. NFC (#96694)
topperc Jun 26, 2024
d0527ab
[libc] Fix Fuscia builder failing on atomic warnings (#96791)
jhuber6 Jun 26, 2024
637b7f8
[libc++] Classify iota_view precondition (#96662)
ldionne Jun 26, 2024
e1015ae
[libc][docs] List `rand` and `srand` as supported on the GPU (#96757)
jhuber6 Jun 26, 2024
acd6cb8
[RISCV][GISel] Support fcmp and fclass for Zfh. (#96696)
topperc Jun 26, 2024
7c4fc9c
[libc][fcntl] Simplify the handling of the return value from syscall …
simonzgx Jun 26, 2024
581fd2f
[RISCV][GISel] Support fptoi and itofp for Zfh. (#96707)
topperc Jun 26, 2024
0d9172e
[SPIR-V] Improve pattern matching and tracking of constant integers (…
VyacheslavLevytskyy Jun 26, 2024
bb50bc2
[SPIR-V] Fix support of OpGenericCastToPtr __spirv_ wrappers (#96655)
VyacheslavLevytskyy Jun 26, 2024
378630b
[SPIR-V] Support cl_ext_float_atomics and fix errors in definition of…
VyacheslavLevytskyy Jun 26, 2024
8ab6677
[RISCV] Add Zfh to typeIsScalarFPArith and simplify code. NFC
topperc Jun 26, 2024
868fae1
[RISCV][GISel] Support G_FPEXT/FPTRUNC with ZFh.
topperc Jun 26, 2024
141bea8
[DirectX] Add stub PSV0 section (#96712)
damyanp Jun 26, 2024
0f24a46
[llvm-config] Make llvm-config --system-libs obey LLVM_USE_STATIC_ZST…
khuey Jun 26, 2024
3f78d89
[TypeProf][InstrFDO]Omit vtable symbols in indexed profiles by defaul…
mingmingl-llvm Jun 26, 2024
d29fdfb
[LTO] Avoid assert fail on failed pass plugin load (#96691)
jdenny-ornl Jun 26, 2024
d058b51
Revert "[AArch64] Lower extending sitofp using tbl (#92528)"
momchil-velikov Jun 26, 2024
22b36bf
[Memprof] Fix a warning
kazutakahirata Jun 26, 2024
6b1c51b
[OpenMP] Migrate GPU Reductions CodeGen from Clang to OMPIRBuilder (#…
TIFitis Jun 26, 2024
a89a541
Fix test introduced by d29fdfbc4e3b
jdenny-ornl Jun 26, 2024
133492f
[libc] add proxy header for struct_sigaction (#96224)
SchrodingerZhu Jun 26, 2024
57d3d07
[libc] added newhdrgen class implementation (#96710)
RoseZhang03 Jun 26, 2024
a54704d
[WebAssembly] Split and tidy up target features test (#96735)
aheejin Jun 26, 2024
eeb9316
[runtimes][NFC] Improve error message when running Lit incorrectly
ldionne Jun 26, 2024
2582d11
[ADT] Always use 32-bit size type for SmallVector with 16-bit element…
jayfoad Jun 26, 2024
2f57df5
[CodeGen] Fix a warning
kazutakahirata Jun 26, 2024
49e5cd2
[LV][NFC] Marked functions as const. Added LLVM_DEBUG. (#96681)
nikolaypanchenko Jun 26, 2024
02b57de
[libc] NVPTX Profiling (#92009)
jameshu15869 Jun 26, 2024
0c56fd0
[SimplifyCFG] Forward indirect switch condition value if it can help …
dianqk Jun 26, 2024
1abe22c
Revert "[ADT] Always use 32-bit size type for SmallVector with 16-bit…
chelcassanova Jun 26, 2024
0d27d54
add timing utils for amdgpu
jameshu15869 Jun 15, 2024
2a6f15d
correctly store input arguments into registers
jameshu15869 Jun 30, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
3 changes: 3 additions & 0 deletions .git-blame-ignore-revs
Original file line number Diff line number Diff line change
Expand Up @@ -81,3 +81,6 @@ f6d557ee34b6bbdb1dc32f29e34b4a4a8ad35e81

# [NFC] clang-format utils/TableGen (#80973)
b9079baaddfed5e604fbfaa1d81a7a1c38e78c26

# [libc++][NFC] Run clang-format on libcxx/include again (#95874)
e2c2ffbe7a1b5d9e32a2ce64279475b50c4cba5b
13 changes: 9 additions & 4 deletions .github/CODEOWNERS
Original file line number Diff line number Diff line change
Expand Up @@ -64,8 +64,8 @@ clang/test/AST/Interp/ @tbaederr
/mlir/Dialect/*/Transforms/Bufferize.cpp @matthias-springer

# Linalg Dialect in MLIR.
/mlir/include/mlir/Dialect/Linalg/* @dcaballe @nicolasvasilache @rengolin
/mlir/lib/Dialect/Linalg/* @dcaballe @nicolasvasilache @rengolin
/mlir/include/mlir/Dialect/Linalg @dcaballe @nicolasvasilache @rengolin
/mlir/lib/Dialect/Linalg @dcaballe @nicolasvasilache @rengolin
/mlir/lib/Dialect/Linalg/Transforms/DecomposeLinalgOps.cpp @MaheshRavishankar @nicolasvasilache
/mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp @MaheshRavishankar @nicolasvasilache
/mlir/lib/Dialect/Linalg/Transforms/ElementwiseOpFusion.cpp @MaheshRavishankar @nicolasvasilache
Expand All @@ -85,8 +85,8 @@ clang/test/AST/Interp/ @tbaederr
/mlir/**/*VectorToSCF* @banach-space @dcaballe @matthias-springer @nicolasvasilache
/mlir/**/*VectorToLLVM* @banach-space @dcaballe @nicolasvasilache
/mlir/**/*X86Vector* @aartbik @dcaballe @nicolasvasilache
/mlir/include/mlir/Dialect/Vector/* @dcaballe @nicolasvasilache
/mlir/lib/Dialect/Vector/* @dcaballe @nicolasvasilache
/mlir/include/mlir/Dialect/Vector @dcaballe @nicolasvasilache
/mlir/lib/Dialect/Vector @dcaballe @nicolasvasilache
/mlir/lib/Dialect/Vector/Transforms/* @hanhanW @nicolasvasilache
/mlir/lib/Dialect/Vector/Transforms/VectorEmulateNarrowType.cpp @MaheshRavishankar @nicolasvasilache
/mlir/**/*EmulateNarrowType* @dcaballe @hanhanW
Expand Down Expand Up @@ -141,3 +141,8 @@ clang/test/AST/Interp/ @tbaederr

# ExtractAPI
/clang/**/ExtractAPI @daniel-grumberg

# DWARFLinker, dwarfutil, dsymutil
/llvm/**/DWARFLinker/ @JDevlieghere
/llvm/**/dsymutil/ @JDevlieghere
/llvm/**/llvm-dwarfutil/ @JDevlieghere
2 changes: 1 addition & 1 deletion .github/workflows/pr-code-format.yml
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ jobs:
- name: Install clang-format
uses: aminya/setup-cpp@v1
with:
clangformat: 18.1.1
clangformat: 18.1.7

- name: Setup Python env
uses: actions/setup-python@v5
Expand Down
104 changes: 104 additions & 0 deletions .github/workflows/release-sources.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,104 @@
name: Release Sources

permissions:
contents: read

on:
workflow_dispatch:
inputs:
release-version:
description: Release Version
required: true
type: string
workflow_call:
inputs:
release-version:
description: Release Version
required: true
type: string
# Run on pull_requests for testing purposes.
pull_request:
paths:
- '.github/workflows/release-sources.yml'
types:
- opened
- synchronize
- reopened
# When a PR is closed, we still start this workflow, but then skip
# all the jobs, which makes it effectively a no-op. The reason to
# do this is that it allows us to take advantage of concurrency groups
# to cancel in progress CI jobs whenever the PR is closed.
- closed

concurrency:
group: ${{ github.workflow }}-${{ inputs.release-version || github.event.pull_request.number }}
cancel-in-progress: True

jobs:
inputs:
name: Collect Job Inputs
if: >-
github.repository_owner == 'llvm' &&
github.event.action != 'closed'
outputs:
ref: ${{ steps.inputs.outputs.ref }}
export-args: ${{ steps.inputs.outputs.export-args }}
runs-on: ubuntu-latest
steps:
- id: inputs
run: |
ref=${{ inputs.release-version || github.sha }}
if [ -n "${{ inputs.release-version }}" ]; then
export_args="-release ${{ inputs.release-version }} -final"
else
export_args="-git-ref ${{ github.sha }}"
fi
echo "ref=$ref" >> $GITHUB_OUTPUT
echo "export-args=$export_args" >> $GITHUB_OUTPUT

release-sources:
name: Package Release Sources
if: github.repository_owner == 'llvm'
runs-on: ubuntu-latest
needs:
- inputs
permissions:
id-token: write
attestations: write
steps:
- name: Checkout LLVM
uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4.1.1
with:
ref: ${{ needs.inputs.outputs.ref }}
fetch-tags: true
- name: Install Dependencies
run: |
pip install --require-hashes -r ./llvm/utils/git/requirements.txt

- name: Check Permissions
if: github.event_name != 'pull_request'
env:
GITHUB_TOKEN: ${{ github.token }}
USER_TOKEN: ${{ secrets.RELEASE_TASKS_USER_TOKEN }}
run: |
./llvm/utils/release/./github-upload-release.py --token "$GITHUB_TOKEN" --user ${{ github.actor }} --user-token "$USER_TOKEN" check-permissions
- name: Create Tarballs
run: |
./llvm/utils/release/export.sh ${{ needs.inputs.outputs.export-args }}
- name: Attest Build Provenance
if: github.event_name != 'pull_request'
id: provenance
uses: actions/attest-build-provenance@897ed5eab6ed058a474202017ada7f40bfa52940 # v1.0.0
with:
subject-path: "*.xz"
- if: github.event_name != 'pull_request'
run: |
mv ${{ steps.provenance.outputs.bundle-path }} .
- name: Create Tarball Artifacts
uses: actions/upload-artifact@65462800fd760344b1a7b4382951275a0abb4808 #v4.3.3
with:
path: |
*.xz
attestation.jsonl


11 changes: 11 additions & 0 deletions .github/workflows/release-tasks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -85,3 +85,14 @@ jobs:
with:
release-version: ${{ needs.validate-tag.outputs.release-version }}
upload: true

release-sources:
name: Package Release Sources
permissions:
id-token: write
attestations: write
needs:
- validate-tag
uses: ./.github/workflows/release-sources.yml
with:
release-version: ${{ needs.validate-tag.outputs.release-version }}
83 changes: 77 additions & 6 deletions bolt/docs/CommandLineArgumentReference.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,14 @@

Allow processing of stripped binaries

- `--alt-inst-feature-size=<uint>`

Size of feature field in .altinstructions

- `--alt-inst-has-padlen`

Specify that .altinstructions has padlen field

- `--asm-dump[=<dump folder>]`

Dump function into assembly
Expand All @@ -78,6 +86,16 @@
in the input is decoded and re-encoded. If the resulting bytes do not match
the input, a warning message is printed.

- `--comp-dir-override=<string>`

Overrides DW_AT_comp_dir, and provides an alterantive base location, which is
used with DW_AT_dwo_name to construct a path to *.dwo files.

- `--create-debug-names-section`

Creates .debug_names section, if the input binary doesn't have it already, for
DWARF5 CU/TUs.

- `--cu-processing-batch-size=<uint>`

Specifies the size of batches for processing CUs. Higher number has better
Expand All @@ -93,7 +111,7 @@

- `--debug-skeleton-cu`

Prints out offsetrs for abbrev and debu_info of Skeleton CUs that get patched.
Prints out offsets for abbrev and debug_info of Skeleton CUs that get patched.

- `--deterministic-debuginfo`

Expand All @@ -104,6 +122,10 @@

Add basic block instructions as tool tips on nodes

- `--dump-alt-instructions`

Dump Linux alternative instructions info

- `--dump-cg=<string>`

Dump callgraph to the given file
Expand All @@ -117,10 +139,34 @@
Dump function CFGs to graphviz format after each stage;enable '-print-loops'
for color-coded blocks

- `--dump-linux-exceptions`

Dump Linux kernel exception table

- `--dump-orc`

Dump raw ORC unwind information (sorted)

- `--dump-para-sites`

Dump Linux kernel paravitual patch sites

- `--dump-pci-fixups`

Dump Linux kernel PCI fixup table

- `--dump-smp-locks`

Dump Linux kernel SMP locks

- `--dump-static-calls`

Dump Linux kernel static calls

- `--dump-static-keys`

Dump Linux kernel static keys jump table

- `--dwarf-output-path=<string>`

Path to where .dwo files or dwp file will be written out to.
Expand Down Expand Up @@ -205,6 +251,14 @@

Skip processing of cold functions

- `--log-file=<string>`

Redirect journaling to a file instead of stdout/stderr

- `--long-jump-labels`

Always use long jumps/nops for Linux kernel static keys

- `--max-data-relocations=<uint>`

Maximum number of data relocations to process
Expand Down Expand Up @@ -274,6 +328,10 @@

Number of tasks to be created per thread

- `--terminal-trap`

Assume that execution stops at trap instruction

- `--thread-count=<uint>`

Number of threads
Expand Down Expand Up @@ -618,10 +676,6 @@
threshold means fewer functions to process. E.g threshold of 90 means only top
10 percent of functions with profile will be processed.

- `--mcf-use-rarcs`

In MCF, consider the possibility of cancelling flow to balance edges

- `--memcpy1-spec=<func1,func2:cs1:cs2,func3:cs1,...>`

List of functions with call sites for which to specialize memcpy() for size 1
Expand Down Expand Up @@ -710,7 +764,7 @@
- `none`: do not reorder functions
- `exec-count`: order by execution count
- `hfsort`: use hfsort algorithm
- `hfsort+`: use hfsort+ algorithm
- `hfsort+`: use cache-directed sort
- `cdsort`: use cache-directed sort
- `pettis-hansen`: use Pettis-Hansen algorithm
- `random`: reorder functions randomly
Expand Down Expand Up @@ -802,6 +856,11 @@

The maximum size of a function to consider for inference.

- `--stale-matching-min-matched-block=<uint>`

Percentage threshold of matched basic blocks at which stale profile inference
is executed.

- `--stale-threshold=<uint>`

Maximum percentage of stale functions to tolerate (default: 100)
Expand Down Expand Up @@ -848,6 +907,10 @@

Only apply branch boundary alignment in hot code

- `--x86-strip-redundant-address-size`

Remove redundant Address-Size override prefix

### BOLT options in relocation mode:

- `--align-macro-fusion=<value>`
Expand Down Expand Up @@ -1034,6 +1097,10 @@

Print clusters

- `--print-estimate-edge-counts`

Print function after edge counts are set for no-LBR profile

- `--print-finalized`

Print function after CFG is finalized
Expand Down Expand Up @@ -1066,6 +1133,10 @@

Print functions after inlining optimization

- `--print-large-functions`

Print functions that could not be overwritten due to excessive size

- `--print-longjmp`

Print functions after longjmp pass
Expand Down
Loading