ENH: unlock the gil during GDAL functions that can take significant time #572

theroggy · 2025-09-04T11:46:49Z

Some GDAL functions (e.g. executing SQL statements,...) can take a while, so nogil-ling those can already give significant performance improvements when using multithreading.

Remark: determining exactly which GDAL functions take time is not trivial as this can depend on the file format,... For files types with a stricter data schema (e.g. shapefile,...) determining column types is fast, for some other file types (e.g. geojson) the file needs to be read entirely first to determine all data types,.... Hence, a relatively large subset of GDAL calls have been no-gilled to avoid missing significant situations.

jorisvandenbossche · 2025-09-05T09:43:05Z

Looks good to me!

…during-execute-of-SQL

jorisvandenbossche · 2025-10-30T19:35:37Z

pyogrio/_io.pyx

+                with nogil:
+                    ogr_feature = OGR_L_GetNextFeature(ogr_layer)


Since we are in a loop here for counting the features, this means for a file with 10,000 rows, we are going to release/acquire the GIL 10,000 times?
(if I am not misreading the code, that might not be optimal)

We could probably move the with nogil up, though, and put the entire loop in here. We just need to acquire the GIL again with gil: when raising the errors

@jorisvandenbossche
I reran my performance tests with and without having with nogil around functions like OGR_L_GetNextFeature in the feature-getting, very high quantity loops using the nz building outlines file.

Conclusion: it doesn't seem to give a significant difference either way:

When reading the entire new zealand building outlines file (3.3 million features) there is no difference whether the nogil is there or not, while it is passed 3.3 million times.

Also when running in parallel threads the differences are within the fluctuations I see when running the script multiple times.

It seems the cost of having it there is not significant, at least in the tests I did. There might be file formats where having the nogil there could be beneficial, but this is pure speculation. Also possible that the nogil behaves a bit different on a loaded server than on a windows desktop, also specfulation.

Hence, I don't have any preference for keeping it or not.

Remark: moving it up, so the entire loop is nogilled takes some effort as check_pointer returns a python object (an exception), which isn't trivial to change... and as there doesn't seem to be a performance impact anyway I don't think it is worth the trouble.

With nogil (also) in the feature reading loops (e.g. around OGR_L_GetNextFeature)

************ with use_arrow=True ******************* len(df)=15, with sql, use_arrow=True took 2.453807899990352 len(df)=15, with where, use_arrow=True took 2.420489699987229 read, lens=[8, 2, 2, 3] sequential, sql, use_arrow=True, took=2.5079255000164267 read, lens=[8, 2, 2, 3] sequential, where, use_arrow=True, 2.5423938000167254 read, lens=[8, 2, 2, 3], parallel, sql, use_arrow=True, took=1.4490034999907948 read, lens=[8, 2, 2, 3], parallel, where, use_arrow=True, took=1.2845507999882102 len(df)=3289194, no filter, use_arrow=True took 7.136394500004826 ************ with use_arrow=False ******************* len(df)=15, with sql, use_arrow=False took 5.860275599989109 len(df)=15, with where, use_arrow=False took 7.902896300016437 read, lens=[8, 2, 2, 3] sequential, sql, use_arrow=False, took=6.309563500020886 read, lens=[8, 2, 2, 3] sequential, where, use_arrow=False, 6.09413590002805 read, lens=[8, 2, 2, 3], parallel, sql, use_arrow=False, took=2.110850199998822 read, lens=[8, 2, 2, 3], parallel, where, use_arrow=False, took=3.5822784999909345 len(df)=3289194, no filter, use_arrow=False took 50.74965380001231

Without nogil in the feature-reading loops

************ with use_arrow=True ******************* len(df)=15, with sql, use_arrow=True took 2.415697899996303 len(df)=15, with where, use_arrow=True took 2.8347185000020545 read, lens=[8, 2, 2, 3] sequential, sql, use_arrow=True, took=2.520717900013551 read, lens=[8, 2, 2, 3] sequential, where, use_arrow=True, 2.9033705000183545 read, lens=[8, 2, 2, 3], parallel, sql, use_arrow=True, took=1.8013735999993514 read, lens=[8, 2, 2, 3], parallel, where, use_arrow=True, took=1.212515799998073 len(df)=3289194, no filter, use_arrow=True took 7.0017002999957185 ************ with use_arrow=False ******************* len(df)=15, with sql, use_arrow=False took 5.824918400001479 len(df)=15, with where, use_arrow=False took 7.959993800002849 read, lens=[8, 2, 2, 3] sequential, sql, use_arrow=False, took=6.387824200006435 read, lens=[8, 2, 2, 3] sequential, where, use_arrow=False, 6.192911899997853 read, lens=[8, 2, 2, 3], parallel, sql, use_arrow=False, took=2.3500404999940656 read, lens=[8, 2, 2, 3], parallel, where, use_arrow=False, took=3.5810892000154126 len(df)=3289194, no filter, use_arrow=False took 51.31016650001402

Script used:

"""Performance testing for multithreaded reading.""" import time from concurrent import futures from time import perf_counter import pyogrio if __name__ == "__main__": path = "C:/temp/lds-nz-building-outlines/nz-building-outlines.gpkg" # path = r"C:\Temp\prc2023\prc2023.gpkg" nb_workers = 4 layer = pyogrio.list_layers(path)[0][0] info = pyogrio.read_info(path, layer=layer, force_feature_count=True) featurecount = info["features"] where = "{filter} ST_NPOINTS(st_buffer(geom, 10)) > 2000" # Set filter so each chunk in the parallel read returns some features, to avoid a # bug in gdal with arrow where empty results are a lot slower. where = "{filter} ST_NPOINTS(geom) > 500" sql_template = f""" SELECT * from "{layer}" WHERE {where} """ for use_arrow in [True, False]: print(f"************ with {use_arrow=} *******************") start = perf_counter() df = pyogrio.read_dataframe( path, sql=sql_template.format(filter=""), use_arrow=use_arrow ) print(f"{len(df)=}, with sql, {use_arrow=} took {perf_counter() - start}") start = perf_counter() df = pyogrio.read_dataframe( path, where=where.format(filter=""), use_arrow=use_arrow ) print(f"{len(df)=}, with where, {use_arrow=} took {perf_counter() - start}") time.sleep(5) # Give cpu's a break # Test multithreading performance # ------------------------------- # Divide featurecount in nb_workers ranges ranges = [] for i in range(nb_workers): start_fc = int(i * featurecount / nb_workers) end_fc = int((i + 1) * featurecount / nb_workers) ranges.append(f"fid >= {start_fc} AND fid < {end_fc} AND ") # Test reading sequentially with sql start = perf_counter() lens = [] for filter in ranges: sql = sql_template.format(filter=filter) df = pyogrio.read_dataframe(path, sql=sql, use_arrow=use_arrow) lens.append(len(df)) took = perf_counter() - start print(f"read, {lens=} sequential, sql, {use_arrow=}, {took=}") # Test reading sequentially with where start = perf_counter() lens = [] for filter in ranges: df = pyogrio.read_dataframe( path, where=where.format(filter=filter), use_arrow=use_arrow ) lens.append(len(df)) print( f"read, {lens=} sequential, where, {use_arrow=}, {perf_counter() - start}" ) time.sleep(5) # Give cpu's a break # Test reading in parallel with sql start = perf_counter() with futures.ThreadPoolExecutor(max_workers=nb_workers) as executor: results = list( executor.map( lambda f, ua: pyogrio.read_dataframe( path, sql=sql_template.format(filter=f), use_arrow=ua ), ranges, [use_arrow] * nb_workers, ) ) lens = [len(df) for df in results] took = perf_counter() - start print(f"read, {lens=}, parallel, sql, {use_arrow=}, {took=}") # Test reading in parallel with where start = perf_counter() with futures.ThreadPoolExecutor(max_workers=nb_workers) as executor: # print(f"where: {where.format(filter=ranges[0])}") results = list( executor.map( lambda f, ua: pyogrio.read_dataframe( path, where=where.format(filter=f), use_arrow=ua ), ranges, [use_arrow] * nb_workers, ) ) lens = [len(df) for df in results] took = perf_counter() - start print(f"read, {lens=}, parallel, where, {use_arrow=}, {took=}") # Finally, read without any filter # -------------------------------- start = perf_counter() df = pyogrio.read_dataframe(path, use_arrow=use_arrow) print(f"{len(df)=}, no filter, {use_arrow=} took {perf_counter() - start}")

jorisvandenbossche · 2025-10-30T19:37:30Z

pyogrio/_io.pyx

+                with nogil:
+                    ogr_feature = OGR_L_GetNextFeature(ogr_layer)


Similar comment here, although in this case we can't really move it up (and a few similar cases below as well)

…during-execute-of-SQL

theroggy added 30 commits April 3, 2022 01:59

Fix: pass kwargs on to OGR write

557763e

Merge remote-tracking branch 'upstream/main'

90cf78a

Merge remote-tracking branch 'upstream/main'

14c01c5

Merge remote-tracking branch 'upstream/main'

a924eb0

Merge remote-tracking branch 'upstream/main'

da65c46

Merge remote-tracking branch 'upstream/main'

cd1cfd8

Merge remote-tracking branch 'upstream/main'

22434dd

Merge remote-tracking branch 'upstream/main'

050df67

Merge remote-tracking branch 'upstream/main'

d999ec3

Merge remote-tracking branch 'upstream/main'

b92b3c3

Merge remote-tracking branch 'upstream/main'

238f151

Merge remote-tracking branch 'upstream/main'

1127ad7

Merge remote-tracking branch 'upstream/main'

664d8fe

Merge remote-tracking branch 'upstream/main'

ca87d9b

Merge remote-tracking branch 'upstream/main'

813c3f6

Merge remote-tracking branch 'upstream/main'

f53a0ad

Merge remote-tracking branch 'upstream/main'

987f47e

Merge remote-tracking branch 'upstream/main'

1ee3968

Merge remote-tracking branch 'upstream/main'

ed6432a

Merge remote-tracking branch 'upstream/main'

d89ed55

Merge remote-tracking branch 'upstream/main'

b050651

Merge remote-tracking branch 'upstream/main'

6d52e83

Merge remote-tracking branch 'upstream/main'

ff9dec9

Merge remote-tracking branch 'upstream/main'

32b7580

Merge remote-tracking branch 'upstream/main'

cf99380

Merge remote-tracking branch 'upstream/main'

482373f

Merge remote-tracking branch 'upstream/main'

fe56ddb

Merge remote-tracking branch 'upstream/main'

fb856c0

Merge remote-tracking branch 'upstream/main'

cd7ea7f

Merge branch 'main' of https://github.com/theroggy/pyogrio

6c0a4c9

theroggy added 12 commits April 15, 2025 09:30

Merge remote-tracking branch 'upstream/main'

b2f6b1a

Merge remote-tracking branch 'upstream/main'

2ac0044

Merge remote-tracking branch 'upstream/main'

e614d34

Merge remote-tracking branch 'upstream/main'

b425e83

Merge remote-tracking branch 'upstream/main'

951529a

Merge remote-tracking branch 'upstream/main'

a153ec3

Merge remote-tracking branch 'upstream/main'

80f5a4c

Merge remote-tracking branch 'upstream/main'

6f24d6e

Merge remote-tracking branch 'upstream/main'

b8514e4

Merge remote-tracking branch 'upstream/main'

b34b903

ENH: unlock the gil during execute of SQL

6dbef2c

Use check_pointer outside nogil

d833903

theroggy changed the title ~~ENH: unlock the gil during execute of sql~~ ENH: unlock the gil during GDALDatasetExecuteSQL Sep 5, 2025

theroggy modified the milestone: 0.11.0 Sep 5, 2025

theroggy marked this pull request as ready for review September 5, 2025 19:45

Update CHANGES.md

7841b82

theroggy added this to the 0.11.0 milestone Sep 5, 2025

Use nogil on all GDAL functions that can take significant time

86b5b86

theroggy changed the title ~~ENH: unlock the gil during GDALDatasetExecuteSQL~~ ENH: unlock the gil during GDAL functions that can take significant time Sep 8, 2025

Update CHANGES.md

9b76da1

theroggy modified the milestones: 0.11.0, 0.12.0 Sep 14, 2025

Merge remote-tracking branch 'upstream/main' into ENH-unlock-the-gil-…

7eb3627

…during-execute-of-SQL

jorisvandenbossche reviewed Oct 30, 2025

View reviewed changes

theroggy added 2 commits October 31, 2025 01:57

Merge remote-tracking branch 'upstream/main' into ENH-unlock-the-gil-…

1bc7a4b

…during-execute-of-SQL

Merge remote-tracking branch 'upstream/main' into ENH-unlock-the-gil-…

0e2b8c1

…during-execute-of-SQL

theroggy modified the milestones: 0.12.0, 0.12.1 Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: unlock the gil during GDAL functions that can take significant time #572

ENH: unlock the gil during GDAL functions that can take significant time #572

theroggy commented Sep 4, 2025 •

edited

Loading

Uh oh!

jorisvandenbossche commented Sep 5, 2025

Uh oh!

jorisvandenbossche Oct 30, 2025

Uh oh!

jorisvandenbossche Oct 30, 2025

Uh oh!

theroggy Oct 31, 2025 •

edited

Loading

Uh oh!

jorisvandenbossche Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ENH: unlock the gil during GDAL functions that can take significant time #572

Are you sure you want to change the base?

ENH: unlock the gil during GDAL functions that can take significant time #572

Conversation

theroggy commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jorisvandenbossche commented Sep 5, 2025

Uh oh!

jorisvandenbossche Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

theroggy Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

With nogil (also) in the feature reading loops (e.g. around OGR_L_GetNextFeature)

Without nogil in the feature-reading loops

Uh oh!

jorisvandenbossche Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

theroggy commented Sep 4, 2025 •

edited

Loading

theroggy Oct 31, 2025 •

edited

Loading

With `nogil` (also) in the feature reading loops (e.g. around `OGR_L_GetNextFeature`)

Without `nogil` in the feature-reading loops