We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
Another inaccuracy issue in XML extraction for tables.
If the table contains one or more empty cells, the XML simply ignore it. For example, it makes a table with 3 row into 2 row.
<table> <row span="3"> <cell>a</cell> <cell>b</cell> </row> <row span="3"> <cell>f</cell> <cell>s</cell> <cell>s</cell> </row> <row> <cell>g</cell> <cell>b</cell> </row> </table>
It's better to extract empty cells as empty string or None to keep the layout correct.
<table> <row span="3"> <cell>a</cell> <cell></cell> <cell>b</cell> </row> <row span="3"> <cell>f</cell> <cell>s</cell> <cell>s</cell> </row> <row> <cell>g</cell> <cell>b</cell> <cell>None</cell> </row> </table>
The text was updated successfully, but these errors were encountered:
It's not a bug in itself be I agree things could be improved, do you want to work on a PR?
Sorry, something went wrong.
I wish I could but my little, self-taught knowledge of Python and GitHub does not allow me to get my hands on PRs. 😞
@naktinis You wrote code targeting tables, maybe you are also interested.
No branches or pull requests
Hi,
Another inaccuracy issue in XML extraction for tables.
If the table contains one or more empty cells, the XML simply ignore it. For example, it makes a table with 3 row into 2 row.
It's better to extract empty cells as empty string or None to keep the layout correct.
The text was updated successfully, but these errors were encountered: