Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
57 commits
Select commit Hold shift + click to select a range
f29bfc1
Added cmake logic to install components into rocm-bin
estewart08 Jul 15, 2020
7a97056
Merge branch 'amd-stg-openmp' into aomp-epsdb
estewart08 Aug 10, 2020
b85f26c
Merge branch 'amd-stg-openmp' into aomp-epsdb
estewart08 Sep 3, 2020
b277085
Removed utils for UC.
estewart08 Sep 4, 2020
c21f25e
PR 31 to cleanup libm build for aomp
ronlieb Oct 13, 2020
a627766
PR 31 to cleanup libm build for aomp
ronlieb Oct 13, 2020
29eaee6
Cleanup cmake by removing hip-device-lib-path and adding nogpuinc to …
estewart08 Oct 29, 2020
d6827a4
slimmed down aompcc
ronlieb Dec 1, 2020
cc6154e
Add soft link for aompcc for UC build. (#35)
estewart08 Dec 3, 2020
7d27bde
Add missing ')'
estewart08 Dec 3, 2020
0420b8a
Merge branch 'amd-stg-openmp' into aomp-epsdb
estewart08 Dec 8, 2020
d8faffd
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Dec 9, 2020
9c3a4ae
Install utils into bin which will ultimately land in llvm/bin.
estewart08 Jan 13, 2021
e3cdc71
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Jan 13, 2021
7d3626e
Added back support for -version and --version
dhruvachak Feb 5, 2021
c9dd137
Merge pull request #36 from ROCm-Developer-Tools/dhruva_aompcc_versio…
dhruvachak Feb 9, 2021
f76c3be
Change hardcoded version number to generic
dhruvachak Feb 9, 2021
70eabbc
Merge pull request #37 from ROCm-Developer-Tools/dhruva_fix_hardcoded…
ronlieb Feb 9, 2021
f2f4fde
Merge branch 'amd-stg-openmp' into aomp-epsdb
estewart08 Feb 24, 2021
0aa6595
Add f90print f90printi f90printf f90printd routines
ronlieb Feb 27, 2021
2f321ea
Add linking of ocml.bc into libm to resolve undefined symbols.
estewart08 Mar 17, 2021
ccd2930
Add message to print out AMDDeviceLibs_DIR location.
estewart08 Mar 18, 2021
6d8e844
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Apr 15, 2021
7c66cf0
Fix for aomp13 an cov4
ronlieb May 4, 2021
93f4bd7
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 May 5, 2021
f820ba2
Add f90printl to do long (int64)
ronlieb May 5, 2021
9afbd7b
remove the build of libm from extras and move it to llvm-project/open…
gregrodgers May 18, 2021
e7a82d1
Merge commit 'f820ba26a4bdb886' from aomp-epsdb into aomp-epsdb-mainline
estewart08 Jun 10, 2021
d949120
Change spelling fs::F_None to fs::OF_None
ronlieb Jun 17, 2021
d9be85c
Add gfx1030 to gputable.txt and mygpu.
estewart08 Jul 16, 2021
c2bd38b
Updates to gputable.txt and mygpu.
estewart08 Jul 22, 2021
7c0fef8
Cherry-pick changes for mygpu.
estewart08 Jul 22, 2021
cd310aa
merge aomp-epsdb into aomp-epsdb-mainline
Aug 3, 2021
7a84cd1
Add additional cards to gpulist/mymcpu.
estewart08 Aug 9, 2021
d1f411a
Cherry-pick more changes for mygpu.
estewart08 Aug 10, 2021
807a423
merge aomp-epsdb into aomp-espdb-mainline
Sep 24, 2021
a14b11e
merge aomp-epsdb into aomp-epsdb-mainline
Dec 13, 2021
9184c6a
Merge branch 'aomp-dev' into aomp-epsdb
estewart08 Mar 22, 2022
b458abe
Add installation of gpurun and openmpi_cu_mask for ROCm.
estewart08 Mar 22, 2022
87ef298
merge aomp-epsdb into aomp-espdb-mainline
Mar 22, 2022
bd38e1e
Fix faulty merge conflict resolution.
estewart08 Mar 28, 2022
576676b
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Mar 28, 2022
55baa1b
ROCm 5.3 package reorg. (#44) (#45)
estewart08 Jun 1, 2022
2cdac1b
Merge branch 'aomp-dev' into aomp-epsdb
estewart08 Jun 12, 2022
f3f027a
Cherry-pick copyright changes to mainline.
estewart08 Jun 16, 2022
98210a4
ROCm 5.3 package reorg. (#44) (#45)
estewart08 Jun 1, 2022
b354f94
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Jun 27, 2022
f70bbb7
Merge branch 'aomp-dev' into aomp-epsdb
estewart08 Aug 3, 2022
1cbc7c6
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Aug 3, 2022
cb4354c
Merge branch 'aomp-dev' into aomp-epsdb
ronlieb Sep 10, 2022
eb6a388
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Sep 14, 2022
eff9b44
Merge branch 'aomp-dev' into aomp-epsdb
estewart08 Oct 5, 2022
b1e0402
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
estewart08 Oct 6, 2022
6c625da
Move copyright and MIT to near top of file, cause palameda wants it
ronlieb Oct 17, 2022
11a3fdd
Revise license per suggestion from Jeff O
ronlieb Oct 19, 2022
2dbefc3
Merge branch 'aomp-epsdb' into aomp-epsdb-mainline
ronlieb Oct 19, 2022
f08356a
[gpurun] - Add init for NUMANODE if null.
estewart08 Mar 6, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
25 changes: 25 additions & 0 deletions aomp-device-libs/aompextras/src/cprint.c
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
// compile with
// /home/rlieberm/rocm/aomp_13.0-2/bin/clang -O2 -target x86_64-pc-linux-gnu -fopenmp -fopenmp-targets=amdgcn-amd-amdhsa -Xopenmp-target=amdgcn-amd-amdhsa -march=gfx906 -c cprint.c -emit-llvm -o cprint.bc -save-temps

// dump the ll
// cp cprint-openmp-amdgcn-amd-amdhsa-gfx906.tmp.ll ~/git/aomp12/aomp-extras/aomp-device-libs/aompextras/src/cprint.ll


#include <stdio.h>
#pragma omp declare target
void f90print_(char *s) {
printf("%s\n", s);
}
void f90printi_(char *s, int *i) {
printf("%s %d\n", s, *i);
}
void f90printl_(char *s, long *i) {
printf("%s %ld\n", s, *i);
}
void f90printf_(char *s, float *f) {
printf("%s %f\n", s, *f);
}
void f90printd_(char *s, double *d) {
printf("%s %g\n", s, *d);
}
#pragma omp end declare target
277 changes: 277 additions & 0 deletions aomp-device-libs/aompextras/src/cprint.ll
Original file line number Diff line number Diff line change
@@ -0,0 +1,277 @@
; ModuleID = 'cprint-openmp-amdgcn-amd-amdhsa-gfx906.tmp.bc'
source_filename = "cprint.c"
target datalayout = "e-p:64:64-p1:64:64-p2:32:32-p3:32:32-p4:64:64-p5:32:32-p6:32:32-i64:64-v16:16-v24:32-v32:32-v48:64-v96:128-v192:256-v256:256-v512:512-v1024:1024-v2048:2048-n32:64-S32-A5-G1-ni:7"
target triple = "amdgcn-amd-amdhsa"

@.str.1 = private unnamed_addr addrspace(4) constant [7 x i8] c"%s %d\0A\00", align 1
@.str.3 = private unnamed_addr addrspace(4) constant [7 x i8] c"%s %f\0A\00", align 1
@.str.4 = private unnamed_addr addrspace(4) constant [7 x i8] c"%s %g\0A\00", align 1

; Function Attrs: alwaysinline norecurse nounwind
define void @f90print_(i8* %s) local_unnamed_addr #0 {
entry:
%0 = tail call i32 @__strlen_max(i8* %s, i32 1024) #2
%total_buffer_size = add i32 %0, 28
%1 = tail call i8* @printf_allocate(i32 %total_buffer_size) #2
%2 = bitcast i8* %1 to i32*
%3 = addrspacecast i32* %2 to i32 addrspace(1)*
store i32 24, i32 addrspace(1)* %3, align 4
%4 = getelementptr inbounds i8, i8* %1, i64 4
%5 = bitcast i8* %4 to i32*
%6 = addrspacecast i32* %5 to i32 addrspace(1)*
store i32 2, i32 addrspace(1)* %6, align 4
%7 = getelementptr inbounds i8, i8* %1, i64 8
%8 = bitcast i8* %7 to i32*
%9 = addrspacecast i32* %8 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %9, align 4
%10 = getelementptr inbounds i8, i8* %1, i64 12
%11 = bitcast i8* %10 to i32*
%12 = addrspacecast i32* %11 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %12, align 4
%13 = getelementptr inbounds i8, i8* %1, i64 16
%14 = bitcast i8* %13 to i32*
%15 = addrspacecast i32* %14 to i32 addrspace(1)*
store i32 4, i32 addrspace(1)* %15, align 4
%16 = getelementptr inbounds i8, i8* %1, i64 20
%17 = bitcast i8* %16 to i32*
%18 = addrspacecast i32* %17 to i32 addrspace(1)*
store i32 %0, i32 addrspace(1)* %18, align 4
%19 = getelementptr inbounds i8, i8* %1, i64 24
%20 = bitcast i8* %19 to i32*
%21 = addrspacecast i32* %20 to i32 addrspace(1)*
store i32 684837, i32 addrspace(1)* %21, align 1
%22 = getelementptr inbounds i8, i8* %1, i64 28
%23 = addrspacecast i8* %22 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* align 1 %23, i8* align 1 %s, i32 %0, i1 false)
%24 = tail call i32 @printf_execute(i8* %1, i32 %total_buffer_size) #2
ret void
}

declare i32 @__strlen_max(i8*, i32) local_unnamed_addr

declare i8* @printf_allocate(i32) local_unnamed_addr

; Function Attrs: argmemonly nofree nosync nounwind willreturn
declare void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* noalias nocapture writeonly, i8* noalias nocapture readonly, i32, i1 immarg) #1

declare i32 @printf_execute(i8*, i32) local_unnamed_addr

; Function Attrs: alwaysinline norecurse nounwind
define void @f90printi_(i8* %s, i32* nocapture readonly %i) local_unnamed_addr #0 {
entry:
%0 = load i32, i32* %i, align 4, !tbaa !4
%1 = tail call i32 @__strlen_max(i8* %s, i32 1024) #2
%total_buffer_size = add i32 %1, 39
%2 = tail call i8* @printf_allocate(i32 %total_buffer_size) #2
%3 = bitcast i8* %2 to i32*
%4 = addrspacecast i32* %3 to i32 addrspace(1)*
store i32 32, i32 addrspace(1)* %4, align 4
%5 = getelementptr inbounds i8, i8* %2, i64 4
%6 = bitcast i8* %5 to i32*
%7 = addrspacecast i32* %6 to i32 addrspace(1)*
store i32 3, i32 addrspace(1)* %7, align 4
%8 = getelementptr inbounds i8, i8* %2, i64 8
%9 = bitcast i8* %8 to i32*
%10 = addrspacecast i32* %9 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %10, align 4
%11 = getelementptr inbounds i8, i8* %2, i64 12
%12 = bitcast i8* %11 to i32*
%13 = addrspacecast i32* %12 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %13, align 4
%14 = getelementptr inbounds i8, i8* %2, i64 16
%15 = bitcast i8* %14 to i32*
%16 = addrspacecast i32* %15 to i32 addrspace(1)*
store i32 852000, i32 addrspace(1)* %16, align 4
%17 = getelementptr inbounds i8, i8* %2, i64 20
%18 = bitcast i8* %17 to i32*
%19 = addrspacecast i32* %18 to i32 addrspace(1)*
store i32 7, i32 addrspace(1)* %19, align 4
%20 = getelementptr inbounds i8, i8* %2, i64 24
%21 = bitcast i8* %20 to i32*
%22 = addrspacecast i32* %21 to i32 addrspace(1)*
store i32 %1, i32 addrspace(1)* %22, align 4
%23 = getelementptr inbounds i8, i8* %2, i64 28
%24 = bitcast i8* %23 to i32*
%25 = addrspacecast i32* %24 to i32 addrspace(1)*
store i32 %0, i32 addrspace(1)* %25, align 4
%26 = getelementptr inbounds i8, i8* %2, i64 32
%27 = addrspacecast i8* %26 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p4i8.i64(i8 addrspace(1)* noundef align 1 dereferenceable(7) %27, i8 addrspace(4)* noundef align 1 dereferenceable(7) getelementptr inbounds ([7 x i8], [7 x i8] addrspace(4)* @.str.1, i64 0, i64 0), i64 7, i1 false)
%28 = getelementptr inbounds i8, i8* %2, i64 39
%29 = addrspacecast i8* %28 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* align 1 %29, i8* align 1 %s, i32 %1, i1 false)
%30 = tail call i32 @printf_execute(i8* %2, i32 %total_buffer_size) #2
ret void
}

; Function Attrs: alwaysinline norecurse nounwind
define void @f90printl_(i8* %s, i64* nocapture readonly %i) local_unnamed_addr #0 {
entry:
%0 = load i64, i64* %i, align 8, !tbaa !8
%1 = tail call i32 @__strlen_max(i8* %s, i32 1024) #2
%total_buffer_size = add i32 %1, 48
%2 = tail call i8* @printf_allocate(i32 %total_buffer_size) #2
%3 = bitcast i8* %2 to i32*
%4 = addrspacecast i32* %3 to i32 addrspace(1)*
store i32 40, i32 addrspace(1)* %4, align 4
%5 = getelementptr inbounds i8, i8* %2, i64 4
%6 = bitcast i8* %5 to i32*
%7 = addrspacecast i32* %6 to i32 addrspace(1)*
store i32 3, i32 addrspace(1)* %7, align 4
%8 = getelementptr inbounds i8, i8* %2, i64 8
%9 = bitcast i8* %8 to i32*
%10 = addrspacecast i32* %9 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %10, align 4
%11 = getelementptr inbounds i8, i8* %2, i64 12
%12 = bitcast i8* %11 to i32*
%13 = addrspacecast i32* %12 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %13, align 4
%14 = getelementptr inbounds i8, i8* %2, i64 16
%15 = bitcast i8* %14 to i32*
%16 = addrspacecast i32* %15 to i32 addrspace(1)*
store i32 852032, i32 addrspace(1)* %16, align 4
%17 = getelementptr inbounds i8, i8* %2, i64 20
%18 = bitcast i8* %17 to i32*
%19 = addrspacecast i32* %18 to i32 addrspace(1)*
store i32 8, i32 addrspace(1)* %19, align 4
%20 = getelementptr inbounds i8, i8* %2, i64 24
%21 = bitcast i8* %20 to i32*
%22 = addrspacecast i32* %21 to i32 addrspace(1)*
store i32 %1, i32 addrspace(1)* %22, align 4
%23 = getelementptr inbounds i8, i8* %2, i64 32
%24 = bitcast i8* %23 to i64*
%25 = addrspacecast i64* %24 to i64 addrspace(1)*
store i64 %0, i64 addrspace(1)* %25, align 8
%26 = getelementptr inbounds i8, i8* %2, i64 40
%27 = bitcast i8* %26 to i64*
%28 = addrspacecast i64* %27 to i64 addrspace(1)*
store i64 2925165409235749, i64 addrspace(1)* %28, align 1
%29 = getelementptr inbounds i8, i8* %2, i64 48
%30 = addrspacecast i8* %29 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* align 1 %30, i8* align 1 %s, i32 %1, i1 false)
%31 = tail call i32 @printf_execute(i8* %2, i32 %total_buffer_size) #2
ret void
}

; Function Attrs: alwaysinline norecurse nounwind
define void @f90printf_(i8* %s, float* nocapture readonly %f) local_unnamed_addr #0 {
entry:
%0 = load float, float* %f, align 4, !tbaa !10
%conv = fpext float %0 to double
%1 = tail call i32 @__strlen_max(i8* %s, i32 1024) #2
%total_buffer_size = add i32 %1, 47
%2 = tail call i8* @printf_allocate(i32 %total_buffer_size) #2
%3 = bitcast i8* %2 to i32*
%4 = addrspacecast i32* %3 to i32 addrspace(1)*
store i32 40, i32 addrspace(1)* %4, align 4
%5 = getelementptr inbounds i8, i8* %2, i64 4
%6 = bitcast i8* %5 to i32*
%7 = addrspacecast i32* %6 to i32 addrspace(1)*
store i32 3, i32 addrspace(1)* %7, align 4
%8 = getelementptr inbounds i8, i8* %2, i64 8
%9 = bitcast i8* %8 to i32*
%10 = addrspacecast i32* %9 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %10, align 4
%11 = getelementptr inbounds i8, i8* %2, i64 12
%12 = bitcast i8* %11 to i32*
%13 = addrspacecast i32* %12 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %13, align 4
%14 = getelementptr inbounds i8, i8* %2, i64 16
%15 = bitcast i8* %14 to i32*
%16 = addrspacecast i32* %15 to i32 addrspace(1)*
store i32 196672, i32 addrspace(1)* %16, align 4
%17 = getelementptr inbounds i8, i8* %2, i64 20
%18 = bitcast i8* %17 to i32*
%19 = addrspacecast i32* %18 to i32 addrspace(1)*
store i32 7, i32 addrspace(1)* %19, align 4
%20 = getelementptr inbounds i8, i8* %2, i64 24
%21 = bitcast i8* %20 to i32*
%22 = addrspacecast i32* %21 to i32 addrspace(1)*
store i32 %1, i32 addrspace(1)* %22, align 4
%23 = getelementptr inbounds i8, i8* %2, i64 32
%24 = bitcast i8* %23 to double*
%25 = addrspacecast double* %24 to double addrspace(1)*
store double %conv, double addrspace(1)* %25, align 8
%26 = getelementptr inbounds i8, i8* %2, i64 40
%27 = addrspacecast i8* %26 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p4i8.i64(i8 addrspace(1)* noundef align 1 dereferenceable(7) %27, i8 addrspace(4)* noundef align 1 dereferenceable(7) getelementptr inbounds ([7 x i8], [7 x i8] addrspace(4)* @.str.3, i64 0, i64 0), i64 7, i1 false)
%28 = getelementptr inbounds i8, i8* %2, i64 47
%29 = addrspacecast i8* %28 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* align 1 %29, i8* align 1 %s, i32 %1, i1 false)
%30 = tail call i32 @printf_execute(i8* %2, i32 %total_buffer_size) #2
ret void
}

; Function Attrs: alwaysinline norecurse nounwind
define void @f90printd_(i8* %s, double* nocapture readonly %d) local_unnamed_addr #0 {
entry:
%0 = load double, double* %d, align 8, !tbaa !12
%1 = tail call i32 @__strlen_max(i8* %s, i32 1024) #2
%total_buffer_size = add i32 %1, 47
%2 = tail call i8* @printf_allocate(i32 %total_buffer_size) #2
%3 = bitcast i8* %2 to i32*
%4 = addrspacecast i32* %3 to i32 addrspace(1)*
store i32 40, i32 addrspace(1)* %4, align 4
%5 = getelementptr inbounds i8, i8* %2, i64 4
%6 = bitcast i8* %5 to i32*
%7 = addrspacecast i32* %6 to i32 addrspace(1)*
store i32 3, i32 addrspace(1)* %7, align 4
%8 = getelementptr inbounds i8, i8* %2, i64 8
%9 = bitcast i8* %8 to i32*
%10 = addrspacecast i32* %9 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %10, align 4
%11 = getelementptr inbounds i8, i8* %2, i64 12
%12 = bitcast i8* %11 to i32*
%13 = addrspacecast i32* %12 to i32 addrspace(1)*
store i32 983041, i32 addrspace(1)* %13, align 4
%14 = getelementptr inbounds i8, i8* %2, i64 16
%15 = bitcast i8* %14 to i32*
%16 = addrspacecast i32* %15 to i32 addrspace(1)*
store i32 196672, i32 addrspace(1)* %16, align 4
%17 = getelementptr inbounds i8, i8* %2, i64 20
%18 = bitcast i8* %17 to i32*
%19 = addrspacecast i32* %18 to i32 addrspace(1)*
store i32 7, i32 addrspace(1)* %19, align 4
%20 = getelementptr inbounds i8, i8* %2, i64 24
%21 = bitcast i8* %20 to i32*
%22 = addrspacecast i32* %21 to i32 addrspace(1)*
store i32 %1, i32 addrspace(1)* %22, align 4
%23 = getelementptr inbounds i8, i8* %2, i64 32
%24 = bitcast i8* %23 to double*
%25 = addrspacecast double* %24 to double addrspace(1)*
store double %0, double addrspace(1)* %25, align 8
%26 = getelementptr inbounds i8, i8* %2, i64 40
%27 = addrspacecast i8* %26 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p4i8.i64(i8 addrspace(1)* noundef align 1 dereferenceable(7) %27, i8 addrspace(4)* noundef align 1 dereferenceable(7) getelementptr inbounds ([7 x i8], [7 x i8] addrspace(4)* @.str.4, i64 0, i64 0), i64 7, i1 false)
%28 = getelementptr inbounds i8, i8* %2, i64 47
%29 = addrspacecast i8* %28 to i8 addrspace(1)*
tail call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* align 1 %29, i8* align 1 %s, i32 %1, i1 false)
%30 = tail call i32 @printf_execute(i8* %2, i32 %total_buffer_size) #2
ret void
}

; Function Attrs: argmemonly nofree nosync nounwind willreturn
declare void @llvm.memcpy.p1i8.p4i8.i64(i8 addrspace(1)* noalias nocapture writeonly, i8 addrspace(4)* noalias nocapture readonly, i64, i1 immarg) #1

attributes #0 = { alwaysinline norecurse nounwind "frame-pointer"="none" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="gfx906" "target-features"="+16-bit-insts,+ci-insts,+dl-insts,+dot1-insts,+dot2-insts,+dot7-insts,+dpp,+flat-address-space,+gfx8-insts,+gfx9-insts,+s-memrealtime,+s-memtime-inst" }
attributes #1 = { argmemonly nofree nosync nounwind willreturn }
attributes #2 = { nounwind }

!llvm.module.flags = !{!0, !1}
!opencl.ocl.version = !{!2}
!llvm.ident = !{!3}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{i32 7, !"PIC Level", i32 2}
!2 = !{i32 2, i32 0}
!3 = !{!"AOMP_STANDALONE_12.0-0 clang version 13.0.0 (ssh://[email protected]:29418/lightning/ec/llvm-project 25f685d3c7ea9fccd2872f39d7603736a103575e)"}
!4 = !{!5, !5, i64 0}
!5 = !{!"int", !6, i64 0}
!6 = !{!"omnipotent char", !7, i64 0}
!7 = !{!"Simple C/C++ TBAA"}
!8 = !{!9, !9, i64 0}
!9 = !{!"long", !6, i64 0}
!10 = !{!11, !11, i64 0}
!11 = !{!"float", !6, i64 0}
!12 = !{!13, !13, i64 0}
!13 = !{!"double", !6, i64 0}
41 changes: 26 additions & 15 deletions utils/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -154,19 +154,30 @@ add_custom_command( OUTPUT modulefile.phantom

add_custom_target(utils_scripts ALL DEPENDS gpurun.phantom cloc.sh.phantom raja_build.sh.phantom kokkos_build.sh.phantom aompcc.phantom mymcpu.phantom aompversion.phantom clang-ocl.phantom modulefile.phantom aompExtractRegion.phantom)

install(PROGRAMS
${CMAKE_CURRENT_BINARY_DIR}/cloc.sh
${CMAKE_CURRENT_BINARY_DIR}/gpurun
${CMAKE_CURRENT_BINARY_DIR}/raja_build.sh
${CMAKE_CURRENT_BINARY_DIR}/kokkos_build.sh
${CMAKE_CURRENT_BINARY_DIR}/aompcc
${CMAKE_CURRENT_BINARY_DIR}/aompExtractRegion
${CMAKE_CURRENT_BINARY_DIR}/mymcpu
${CMAKE_CURRENT_BINARY_DIR}/aompversion
${CMAKE_CURRENT_SOURCE_DIR}/bin/mygpu
${CMAKE_CURRENT_SOURCE_DIR}/bin/openmpi_set_cu_mask
${CMAKE_CURRENT_SOURCE_DIR}/bin/blt.patch
${CMAKE_CURRENT_SOURCE_DIR}/bin/raja.patch
${CMAKE_CURRENT_BINARY_DIR}/clang-ocl
${CMAKE_CURRENT_BINARY_DIR}/modulefile
if(${AOMP_STANDALONE_BUILD})
install(PROGRAMS
${CMAKE_CURRENT_BINARY_DIR}/cloc.sh
${CMAKE_CURRENT_BINARY_DIR}/gpurun
${CMAKE_CURRENT_BINARY_DIR}/raja_build.sh
${CMAKE_CURRENT_BINARY_DIR}/kokkos_build.sh
${CMAKE_CURRENT_BINARY_DIR}/aompcc
${CMAKE_CURRENT_BINARY_DIR}/aompExtractRegion
${CMAKE_CURRENT_BINARY_DIR}/mymcpu
${CMAKE_CURRENT_BINARY_DIR}/aompversion
${CMAKE_CURRENT_SOURCE_DIR}/bin/mygpu
${CMAKE_CURRENT_SOURCE_DIR}/bin/openmpi_set_cu_mask
${CMAKE_CURRENT_SOURCE_DIR}/bin/blt.patch
${CMAKE_CURRENT_SOURCE_DIR}/bin/raja.patch
${CMAKE_CURRENT_BINARY_DIR}/clang-ocl
${CMAKE_CURRENT_BINARY_DIR}/modulefile
DESTINATION "${CMAKE_INSTALL_BINDIR}")

else()
install(PROGRAMS
${CMAKE_CURRENT_BINARY_DIR}/mymcpu
${CMAKE_CURRENT_BINARY_DIR}/aompcc
${CMAKE_CURRENT_SOURCE_DIR}/bin/mygpu
${CMAKE_CURRENT_SOURCE_DIR}/bin/openmpi_set_cu_mask
${CMAKE_CURRENT_BINARY_DIR}/gpurun
DESTINATION "${DEVEL_PACKAGE}${CMAKE_INSTALL_BINDIR}")
endif()
Loading