Huge thanks to our Platinum Members Endace and LiveAction,
and our Silver Member Veeam, for supporting the Wireshark Foundation and project.

Wireshark-bugs: [Wireshark-bugs] [Bug 2588] Bad UTF-8 string from RSS dissector

Date: Sun, 13 Jul 2008 03:31:55 -0700 (PDT)
https://bugs.wireshark.org/bugzilla/show_bug.cgi?id=2588





--- Comment #10 from Abhik Sarkar <sarkar.abhik@xxxxxxxxx>  2008-07-13 03:31:54 PDT ---
Just some additional thoughts on this one:
One of the UTF-8 characters which is incorrectly printed is: e2 80 9c which is
the "left double quotation mark".

Interestingly, when this part of the data finally comes to print_escaped_xml(),
e2 and 9c are unchanged, but 80 has already been escaped as \x80. I have been
unable to figure out where this got 'escaped'!

The patch I have submitted is (as I have mentioned) a workaround to prevent
non-printable characters from getting into the PDML file. However, this
probably needs more investigation.


-- 
Configure bugmail: https://bugs.wireshark.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.