Skip to content
This repository was archived by the owner on Dec 18, 2024. It is now read-only.
This repository was archived by the owner on Dec 18, 2024. It is now read-only.

In future trim down custom metrics from payload? #269

Closed
HTTPArchive/dataform
#8
@tunetheweb

Description

@tunetheweb

As part of the effort to reduce the payload since by deduplicating data we remove the _custom field (amongst others) when saving this data to all tables:

https://github.com/HTTPArchive/wptagent/blob/e4546673d3b658022afb3885885e696290da53c5/HTTPArchive/httparchive.py#L425-L427

    # Remove the fields that are parsed out into separate columns
    page.pop("_parsed_css", None)
    page.pop("_custom", None)
    page.pop("_lighthouse", None)
    ...

However, that is only a list of the custom metrics:

"_custom": [
        "00_reset",
        "Colordepth",
        "Dpi",
...
        "usertiming",
        "valid-head",
        "well-known",
        "wpt_bodies"
    ],
    "_00_reset": null,
    "_Colordepth": 24,
    ...etc

The more weighty parts are the actual custom metrics beaneath this (_00_reset, _Colordepth...etc), some of which are quite large.

So we should enhance this to remove those too to save a lot of weight.

However, for now, them being in there is useful for the legacy tables (since there is no legacy custom metrics table) so leave for now. But filing this issue for when we move off of legacy so we don't forget.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions