#189 Zero byte read from NFS - Fix method file_read in watched_file.rb #190

sv-gh · 2018-05-11T20:26:18Z

This change has no any impact on currently working code, ment to improve reliability of logs reading in case when size/content update in file write implementation is not atomic in underlying FS software (as in NFSv3). Please approve/accept this fix for the nearest release.
This issue is blocking for our project and logstash use in jeopardy.

…, method file_read

karmi · 2018-05-11T20:30:07Z

Hi @sv-gh, we have found your signature in our records, but it seems like you have signed with a different e-mail than the one used in yout Git commit. Can you please add both of these e-mails into your Github profile (they can be hidden), so we can match your e-mails to your Github profile?

yaauie

@sv-gh while this approach may work for your specific setup, it's not an approach that I believe scales to the wide user-base of this plugin. I appreciate the work you put in, and will work to get an acceptable solution put together shortly.

The primary reason is that the presence of a null-byte isn't necessarily a reliable indicator that the sysread has returned bytes beyond the end of the file:

a null byte is valid in gzipped log files, which means that when reading gzipped logs in this implementation we will frequently need to fall through all attempts, reading, sleeping, and un-seeking repeatedly before finally emitting each chunk.
When reading beyond the end of a file using sysread, the behaviour is undefined; on your system it may be observed to emit one or more null bytes, but this is not specified nor guaranteed.

In order to reliably ensure that we don't read beyond the end of our source files, each time we sysread we will need to first stat the file to determine how much is available.

There is currently work in-flight to improve support for file rotation, which touches this and many other methods, and a follow-up is already planned by @guyboertje to address the root cause of #189 to ensure that we don't sysread beyond the end of a file.

I'll work with him to ensure that the right solution gets implemented in the coming weeks.

yaauie · 2018-06-27T20:25:03Z

lib/filewatch/watched_file.rb

+        set_accessed_at
+        buf = @file.sysread(amount)
+        # return if no zero byte inside
+        return buf unless zc = buf.index("\0")


The zc variable is bound, but unused.

Additionally, for readability we prefer that variables are descriptive and unabiguous, and while I can tell what buf is, variables like cc and dt would be better off with more descriptive names like attempts_remaining and backoff_delay.

yaauie · 2018-06-27T20:34:44Z

lib/filewatch/watched_file.rb

+        # return if no zero byte inside
+        return buf unless zc = buf.index("\0")
+        # update amount to read
+        amount = buf.bytesize


amount here has multiple meanings (amount requested vs amount retrieved), and the mutation of the input parameter combined with the retry pattern means that on each subsequent attempt we will read potentially fewer bytes than initially requested. This is rather opaque and can present surprises later on that would be hard to debug, so we try to avoid overloading variables like this.

sv-gh · 2018-06-27T20:51:45Z

Thanks for your reply, Looking forwards for working final result of your changes, Thanks again, Sergey Volkov

…

On Wed, Jun 27, 2018 at 4:44 PM Ry Biesemeyer ***@***.***> wrote: ***@***.**** requested changes on this pull request. @sv-gh <https://github.com/sv-gh> while this approach may work for your specific setup, it's not an approach that I believe scales to the wide user-base of this plugin. I appreciate the work you put in, and will work to get an acceptable solution put together shortly. ------------------------------ The primary reason is that the presence of a null-byte isn't necessarily a reliable indicator that the sysread has returned bytes beyond the end of the file: - a null byte is valid in gzipped log files, which means that when reading gzipped logs in this implementation we will frequently need to fall through all attempts, reading, sleeping, and un-seeking repeatedly before finally emitting each chunk. - When reading beyond the end of a file using sysread, the behaviour is *undefined*; on your system it may be observed to emit one or more null bytes, but this is not specified nor guaranteed. In order to reliably ensure that we don't read beyond the end of our source files, each time we sysread we will need to first stat the file to determine how much is available. ------------------------------ There is currently work in-flight to improve support for file rotation <#192>, which touches this and many other methods, and a follow-up is already planned by @guyboertje <https://github.com/guyboertje> to address the root cause of #189 <#189> to ensure that we don't sysread beyond the end of a file. I'll work with him to ensure that the right solution gets implemented in the coming weeks. ------------------------------ In lib/filewatch/watched_file.rb <#190 (comment)> : > @@ -89,8 +89,27 @@ def file_seek(amount, whence = IO::SEEK_SET) end def file_read(amount) - set_accessed_at - @file.sysread(amount) + #debug "*** file_read #{path}" + cc = 3 # max attempts + dt = 1.0/128 # delay beetwen attempts + loop do + set_accessed_at + buf = @file.sysread(amount) + # return if no zero byte inside + return buf unless zc = buf.index("\0") The zc variable is bound, but unused. Additionally, for readability we prefer that variables are descriptive and unabiguous, and while I can tell what buf is, variables like cc and dt would be better off with more descriptive names like attempts_remaining and backoff_delay. ------------------------------ In lib/filewatch/watched_file.rb <#190 (comment)> : > @@ -89,8 +89,27 @@ def file_seek(amount, whence = IO::SEEK_SET) end def file_read(amount) - set_accessed_at - @file.sysread(amount) + #debug "*** file_read #{path}" + cc = 3 # max attempts + dt = 1.0/128 # delay beetwen attempts + loop do + set_accessed_at + buf = @file.sysread(amount) + # return if no zero byte inside + return buf unless zc = buf.index("\0") + # update amount to read + amount = buf.bytesize amount here has multiple meanings (amount requested vs amount retrieved), and the mutation of the input parameter combined with the retry pattern means that on each subsequent attempt we will read potentially *fewer* bytes than initially requested. This is rather opaque and can present surprises later on that would be hard to debug, so we try to avoid overloading variables like this. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#190 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AMsw0aaz4tflX1IA197VCE-vvL9w8Rb1ks5uA-7IgaJpZM4T8BHK> .

logstash-plugins#189 Zero byte read from NFS - Fix in watched_file.rb…

93d0e2d

…, method file_read

sv-gh changed the title ~~#189 Zero byte read from NFS - Fix in watched_file.rb, method file_read~~ #189 Zero byte read from NFS - Fix method file_read in watched_file.rb May 11, 2018

yaauie self-assigned this Jun 27, 2018

yaauie requested changes Jun 27, 2018

View reviewed changes

sv-gh closed this Jul 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

#189 Zero byte read from NFS - Fix method file_read in watched_file.rb #190

#189 Zero byte read from NFS - Fix method file_read in watched_file.rb #190

Uh oh!

sv-gh commented May 11, 2018 •

edited

Loading

Uh oh!

karmi commented May 11, 2018

Uh oh!

yaauie left a comment

Uh oh!

yaauie Jun 27, 2018

Uh oh!

yaauie Jun 27, 2018

Uh oh!

sv-gh commented Jun 27, 2018 via email

Uh oh!

Uh oh!

#189 Zero byte read from NFS - Fix method file_read in watched_file.rb #190

#189 Zero byte read from NFS - Fix method file_read in watched_file.rb #190

Uh oh!

Conversation

sv-gh commented May 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

karmi commented May 11, 2018

Uh oh!

yaauie left a comment

Choose a reason for hiding this comment

Uh oh!

yaauie Jun 27, 2018

Choose a reason for hiding this comment

Uh oh!

yaauie Jun 27, 2018

Choose a reason for hiding this comment

Uh oh!

sv-gh commented Jun 27, 2018 via email

Uh oh!

Uh oh!

sv-gh commented May 11, 2018 •

edited

Loading