Clean XHTML code in odt2xhtml converter
When converting XHTML to ODT in xhtml2odt.py, we should remove a series of tags and their content, like "meta" and "style" (at least in a first step). Here is an example of code cut-and-pasted from Word to Kupu:
Point A29 = <meta http-equiv=
content=
content="Microsoft Word 11" /><link href="file:
&amp;amp;lt;!--
/* Font Definitions */
@font-face
{font-family:
panose-1:2 11 6 4 3 5 4 4 2 4;
mso-font-charset:0;
mso-generic-
mso-font-
mso-font-
@font-face
{font-family:
panose-1:2 4 5 3 5 4 6 3 2 4;
mso-font-charset:0;
mso-generic-
mso-font-
mso-font-
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-
margin:0cm;
margin-
mso-pagination:
font-size:10.0pt;
font-family:"Times New Roman";
mso-fareast-
h1
{mso-style-
margin-top:12.0pt;
margin-right:0cm;
margin-
margin-left:0cm;
mso-pagination:
page-break-
mso-outline-
font-size:16.0pt;
font-family:
mso-font-
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;
mso-header-
mso-footer-
mso-paper-
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-
mso-list-
mso-list-
@list l0:level1
{mso-level-
mso-level-
text-indent:
ol
{margin-
ul
{margin-
--&amp;amp;gt;
</style>
Point A28 - <meta http-equiv=
<ol><li><link href="file:
<style>
&amp;lt;!--
/* Font Definitions */
@font-face
{font-family:
panose-1:5 0 0 0 0 0 0 0 0 0;
mso-font-charset:2;
mso-generic-
mso-font-
mso-font-
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-
margin:0cm;
margin-
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;
mso-header-
mso-footer-
mso-paper-
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-
mso-list-
mso-list-
@list l0:level1
{mso-level-
mso-level-
mso-level-text:-;
mso-level-
mso-level-
text-indent:
font-family:"Times New Roman";
mso-fareast-
ol
{margin-
ul
{margin-
--&amp;gt;
</style> <meta http-equiv=
&amp;lt;!--
/* Font Definitions */
@font-face
{font-family:
panose-1:5 0 0 0 0 0 0 0 0 0;
mso-font-charset:2;
mso-generic-
mso-font-
mso-font-
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-
margin:0cm;
margin-
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p.Paragraphedel
{mso-style-
mso-style-
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-
mso-add-space:auto;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;
mso-header-
mso-footer-
mso-paper-
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-
mso-list-
mso-list-
@list l0:level1
{mso-level-
mso-level-
mso-level-text:-;
mso-level-
mso-level-
text-indent:
font-family:"Times New Roman";
mso-fareast-
ol
{margin-
ul
{margin-
--&amp;gt;
</style>
Point A27 - <meta http-equiv=
<link href="file:
<style>
&amp;lt;!--
/* Font Definitions */
@font-face
{font-family:
panose-1:5 0 0 0 0 0 0 0 0 0;
mso-font-charset:2;
mso-generic-
mso-font-
mso-font-
@font-face
{font-family:
panose-1:2 11 6 4 3 5 4 4 2 4;
mso-font-charset:0;
mso-generic-
mso-font-
mso-font-
@font-face
{font-family:
panose-1:2 11 6 4 3 5 4 4 2 4;
mso-font-charset:0;
mso-generic-
mso-font-
mso-font-
@font-face
{font-family:
panose-1:2 11 5 2 2 2 2 2 2 4;
mso-font-charset:0;
mso-generic-
mso-font-
mso-font-
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{mso-style-
margin:0cm;
margin-
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
p
{mso-margin-
margin-right:0cm;
mso-margin-
margin-left:0cm;
mso-pagination:
font-size:12.0pt;
font-family:"Times New Roman";
mso-fareast-
@page Section1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;
mso-header-
mso-footer-
mso-paper-
div.Section1
{page:Section1;}
/* List Definitions */
@list l0
{mso-list-
mso-list-
mso-list-
@list l0:level1
{mso-level-
mso-level-
mso-level-text:-;
mso-level-
mso-level-
text-indent:
font-family:
mso-fareast-
mso-bidi-
ol
{margin-
ul
{margin-
--&amp;gt;
</style>
Blueprint information
- Status:
- Complete
- Approver:
- Gaëtan Delannay
- Priority:
- Undefined
- Drafter:
- Gaëtan Delannay
- Direction:
- Needs approval
- Assignee:
- Gaëtan Delannay
- Definition:
- Obsolete
- Series goal:
- None
- Implementation:
- Unknown
- Milestone target:
- None
- Started by
- Completed by
- Gaëtan Delannay
Related branches
Related bugs
Sprints
Whiteboard
This will not be done at the pod level. appy integrates a XHTML cleaner in appy.shared.