Skip to content

Conversation

richardleach
Copy link
Contributor

@richardleach richardleach commented Apr 15, 2025

Perl_sv_setsv_flags is the heavyweight function for assigning the value(s) of
a source SV to a destination SV. It contains many branches for preparing the
destination SV prior to assignment. However:

  • If the destination SV has just been created, much of that logic isn't needed.
  • When cloning a SV, simple assignments (particularly IVs and PVs) dominate.

This set of commits:

  • Extracts the "is this CoWable?" test from Perl_sv_setsv_flags into a macro.
  • Adds Perl_sv_freshcopy_flags and two static helper functions.
  • Modifies Perl_newSVsv_flags and Perl_sv_mortalcopy_flags to use them.
  • Standardizes a number of call sites that did their own things but really
    should use Perl_newSVsv_flags or Perl_sv_mortalcopy_flags.

Using perl's test harness as a guide:

  • Bodyless code handles 45% of calls to Perl_newSVsv_flags and
    57% of calls to Perl_sv_mortalcopy_flags.
  • The SVt_PV/SVp_POK code handles 32% of calls to
    Perl_newSVsv_flags and 36% of calls to Perl_sv_mortalcopy_flags.
  • S_sv_freshcopy_flags code handles 95% of the remainder in
    Perl_newSVsv_flags and 91% of the remainder in to Perl_sv_mortalcopy_flags.

With these changes compared with a build of blead:

  • perl -e 'for (1..100_000) { my $x = [ (1) x 1000 ]; }' runs 10% faster

  • perl -e 'for (1..100_000) { my $x = [ ("Perl") x 250 ]; }' runs 45% faster


  • This set of changes does require a perldelta entry and has one.

@richardleach richardleach added the defer-next-dev This PR should not be merged yet, but await the next development cycle label Apr 15, 2025
@Leont
Copy link
Contributor

Leont commented Apr 29, 2025

Cloning is rather unfortunate choice of words, given that it has a very specific meaning in our codebase that is quite different from what this PR is about. Renaming the PR may be helpful.

@richardleach richardleach changed the title Dedicated SV cloning code in place of Perl_sv_setsv_flags Dedicated SV copying code in place of Perl_sv_setsv_flags Apr 29, 2025
@richardleach richardleach force-pushed the S_sv_freshcopy_flags branch 2 times, most recently from c392526 to 91c2b99 Compare May 8, 2025 16:54
@richardleach
Copy link
Contributor Author

I've made a lot of changes following earlier comments - thanks for those - and have finally force-pushed.

These changes aren't complete. For example:

  • Measured performance seems worse than in the PR version, so I need to look into that
  • Might change sflag handling/ SvFLAGS(dsv) setting
  • Not settled on struct membet initialisation
  • Might still rename the function that is currently Perl_newSVsv_flags and have newSVsv_flags be a macro that checks (ssv) before calling the sv.c function.

@tonycoz
Copy link
Contributor

tonycoz commented Aug 13, 2025

the last commit's message indicates it should be squashed

@richardleach richardleach force-pushed the S_sv_freshcopy_flags branch 2 times, most recently from 131563e to 1197bc1 Compare August 13, 2025 19:51
@richardleach
Copy link
Contributor Author

Thanks, I've hopefully addressed those comments and squashed all trailing commits.

@richardleach richardleach force-pushed the S_sv_freshcopy_flags branch 2 times, most recently from 27ffa67 to dda1c95 Compare August 13, 2025 23:10
Perl_newSVsv_flags_NN creates a fresh SV that contains the values of its
source SV argument. It's like calling `new_SV(dsv)` followed by
`sv_setsv_flags(dsv, ssv, flags`, but is optimized for a brand new
destination SV and the most common code paths.

The intended initial users for this new function were:
* Perl_sv_mortalcopy_flags (still in sv.c)
* Perl_newSVsv_flags (now a simple function in sv_inline.h)

Perl_newSVsv_flags_NN prioritises the following hot cases:
* SVt_IV containing an IV
* SVt_IV containing an RV
* SVt_NV containing an NV
* SVt_PV containing a PV

It will then check for:
* SVt_NULL
* SVt_IV containing a UV
* SVt_LAST

The helper function S_newSVsv_flags_NN_PVxx is called for everything else.
It will use Perl_sv_setsv_flags as a fallback for rare or tricky cases.

S_newSVsv_flags_NN_POK is a dedicated helper for string swipe/COW/copy
logic and is called from both Perl_newSVsv_flags_NN and
S_newSVsv_flags_NN_PVxx.

With these changes compared with the previous commit:

* `perl -e 'for (1..100_000_0) { my $x = { (1) x 1000 }; }'` runs about 20% faster

* `perl -e 'for (1..100_000_0) { my $x = { ("Perl") x 250 }' runs about 40% faster

* `perl -e 'for (1..100_000_0) { my $x = { a => 1, b => 2, c => 3, d => 4, e => 5 }; }'`
   is a touch faster, but within the margin for error

* `perl -e 'for (1..100_000_0) { my $x = { a => "Perl", b => "Perl", c => "Perl", d => "Perl", e => "Perl" } ; }'`
   runs about 17% faster
Perl_newSVsv_flags has become just a stub around Perl_newSVsv_flags_NN.
For callers where the source SV* is NULL, not having to call a
function in sv.c to immediately return is very desirable.
Besides using the just-introduced faster path for SV copying, this
allows the check for SV_GMAGIC to be pushed into the called function
without having to worry about SV leaks.

Two additional micro-optimizations are also in this commit:
* A pointer to xav_fill is cached for use in the loop. This can
  be used directly to update AvFILLp(av), rather than having to
  get there from av's SV* each time.

* The value of the loop iterator, i, is directly written into
  xav_fill, rather than getting the value in that slot,
  incrementing it (to get the same value as i), and writing it back.
@richardleach richardleach merged commit 945b008 into Perl:blead Aug 23, 2025
33 checks passed
@richardleach richardleach deleted the S_sv_freshcopy_flags branch August 23, 2025 16:44
@richardleach
Copy link
Contributor Author

Many thanks for all the reviews @tonycoz & @bulk88 - it's a relief to finally get this in before the github-actions bot adds the hasConflicts flag again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants