Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upReading Excel HTML files will misalign cells in some cases #1621
Labels
Comments
Thanks for reporting @KurtMar , indeed that line should be the same as the previous check. We'd accept a PR, please change the line in Simple repro: <table>
<tr>
<td>abc</td>
<td><b> </b></td>
<td>def</td>
</tr>
<table> |
KurtMar
added a commit
to KurtMar/sheetjs
that referenced
this issue
Nov 20, 2019
…cells in some cases
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I noticed a bug in the html_to_sheet function. When reading cells that are empty, the cell index is correctly incremented, but if the cell has HTML tags and is observed to be empty after stripping the tags, the cell index is not incremented. This results in misalignment of the subsequent cells:
https://github.com/SheetJS/js-xlsx/blob/e3c5eac99c3b2be6929adfca455c1be87fab792b/xlsx.js#L18956-L18960