Pastelyzer 0.10-beta
Pre-release-
discard
filter action can be used to get rid of unwanted artefacts. -
All non-discarded artefacts are stored in database.
-
Artefacts now have
important
flag which can be set withset-important
filter action. Artefacts marked important are highlighted in dashboard and
the flag is also stored in the database. -
Artefacts now have
note
slot which can be set withset-note
filter
action. This note is stored in the database along with the artefact. -
No more
IMPORTANT-BANK-CARD
class ("important" flag can be used with
BANK-CARD-NUMBER
artefact instead). -
Introduction of user sets — IPv4 networks, super-domains (multi-level) and
bank card bins. Multiple sets of the same or different kinds can be loaded
(previously with command-line options only one set of each kind could be
used). -
Important: the following command line options have been made redundant
with the introduction of user sets:--important-cc-bins
,
--interesting-tlds
,--networks-file
and--tlds-file
. Running
pastelyzer
in server mode with any of these options provided will show you
sample configuration fragments you should integrate into your configuration
instead. The relevant documentation is section "3.2 User sets" (and also
section "3.6 Example configuration"). -
Important: the default value of
--resolve-domains
option is no longer
guessed (since--networks-file
was removed). This means that domain
resolution must be turned on explicitly. -
Comparison filter functions (
<
,=
and>
) can also be used with strings. -
mixed-case?
filter predicate. -
Additional extractors:
note
source-string
bytes
context-before
context-after
source-context
-
Discoveries are logged with
stat
instead ofhit
. -
Added an ability to reprocess previously processed documents. Useful when
upgrading to a new version ofpastelyzer
that changes (hopefully improves)
the way documents are processed (like this version). The documentation
mentions this, but so that users don't miss it:Important: if
pastelyzer
configuration contains rules to send emails or
interact with other systems/programs then the reprocessing might trigger
these rules again. It is therefore advisable to create a separate
configuration file for reprocessing and either omit these rules or change
them so that the information does not go to production systems. -
An experimental API to query the database has been added (for cases when
usingcurl
andjq
is preferable to SQL).