Commit Graph

59 Commits

Author SHA1 Message Date
James Clark
939ec46566 -1 in unknown encoding lead byte table should use BT_MALFORM 1998-06-03 08:25:08 +00:00
James Clark
0e864d50fb Use ints rather than unsigned shorts in unknown encoding support 1998-06-03 07:52:49 +00:00
James Clark
84835ac373 Generalize unknown encoding support 1998-06-02 08:57:14 +00:00
James Clark
e8ac3d8f5f Fix typos in CHECK_*_CASES args 1998-06-02 04:20:08 +00:00
James Clark
9efc5049a8 Support for arbitrary single byte encodings 1998-06-01 12:12:35 +00:00
James Clark
3a10246912 New approach for handling predefined entities 1998-06-01 07:28:31 +00:00
James Clark
496af54711 Avoid converting data unnecessarily 1998-06-01 05:55:23 +00:00
James Clark
6ff56dc9dc Add XML_ROLE_NOTATION_NO_SYSTEM_ID 1998-06-01 03:59:07 +00:00
James Clark
a2625a0572 Finish UTF16 conversion support 1998-05-31 08:06:01 +00:00
James Clark
c230dddef9 Represent UTF16 as char not UTF16_CHAR 1998-05-30 12:47:06 +00:00
James Clark
583f937102 Change handling of internal encodings in xmltok interface. 1998-05-30 10:13:07 +00:00
James Clark
05fd715a12 Require whitespace between attribute name and following attribute value 1998-05-23 07:27:59 +00:00
James Clark
d504c1c894 Add support for US-ASCII encoding 1998-05-11 10:16:36 +00:00
James Clark
d40b82ff26 Fix bug with end of buffer after ) in prolog 1998-05-02 06:03:28 +00:00
James Clark
fa8b616b9c Don't handle CDATA sections as a single token 1998-05-02 05:31:15 +00:00
James Clark
7e71c8f5d5 Fix bug with entity containing ] or ]] 1998-05-02 03:14:22 +00:00
James Clark
d4ee1a1ee1 Add copyright notice. 1998-04-05 15:11:29 +00:00
James Clark
f9ba46fd43 Bug in Latin 1 support 1998-03-23 23:42:30 +00:00
James Clark
6509a36b3f Get rid of // comments 1998-03-15 12:36:14 +00:00
James Clark
fce5a0395f Define calloc as well. 1998-03-09 01:26:05 +00:00
James Clark
a7e34c37ab Initial Revision 1998-03-08 03:39:56 +00:00
James Clark
34e680d0b7 Include xmldef.h 1998-03-08 03:39:32 +00:00
James Clark
e83509516e Update list of public id characters 1998-02-07 05:36:06 +00:00
James Clark
0a8e3db8c7 Deal with externally specified encodings 1998-02-07 04:53:44 +00:00
James Clark
b1c9b03eba Check for invalid UTF-8 characters 1998-02-03 10:04:03 +00:00
James Clark
2f74758c1f Normalize attribute values.
Attribute bug fixes.
1998-02-03 09:09:48 +00:00
James Clark
332b10d36c Add skipS method 1998-02-02 11:24:03 +00:00
James Clark
7cf6cf3188 Change containsRef field in ATTRIBUTE to normalized 1998-02-02 11:15:15 +00:00
James Clark
fdad370c5b Fix bug with UTF16 to UTF8 conversion for chars 0x80..0xFF 1998-02-02 11:14:46 +00:00
James Clark
c70ed036c9 Fix bug in UTF16 to UTF8 conversion 1998-02-02 10:35:37 +00:00
James Clark
3cc8d0075a Fix bug in getAtts with '. 1998-02-02 09:29:51 +00:00
James Clark
2fd2484a0a Use macro to access minBytesPerChar 1998-01-30 19:11:39 +00:00
James Clark
56440b93f3 Fix UTF-16 to UTF-8 conversion on surrogates. 1998-01-22 08:35:51 +00:00
James Clark
2ec75e44bc Fix bug in utf8_encoding with surrogates 1998-01-22 08:22:40 +00:00
James Clark
b07d0e5208 Check that parameter entities are declared. 1997-12-13 06:39:04 +00:00
James Clark
2a4f91e330 Fix bug in checkPiTarget 1997-12-12 09:17:24 +00:00
James Clark
b2eeefe571 New token XML_TOK_XML_DECL 1997-12-12 00:48:27 +00:00
James Clark
db6a73f853 Check for surrogates in NCRs 1997-12-11 14:16:35 +00:00
James Clark
dc8aa626cc Avoid problem with forward reference to const object 1997-12-10 14:19:33 +00:00
James Clark
8b6b5840ee Don't need to check system identifiers 1997-12-10 07:57:10 +00:00
James Clark
9f41c277f9 Changes before release 1997-12-10 07:44:19 +00:00
James Clark
27c6f1b6c5 Initial revision 1997-12-10 07:33:23 +00:00
James Clark
7af19499b0 Fix bug with DTD-less docs. 1997-11-15 05:30:55 +00:00
James Clark
a5be232444 Initial Revision 1997-11-15 01:05:11 +00:00
James Clark
2ad2c72b27 Reorder tokens to group element type declaration only tokens together 1997-11-15 00:51:48 +00:00
James Clark
8a1833a65f Allow #name to be terminated by | 1997-11-14 05:24:37 +00:00
James Clark
a2c83c96b5 Do literals properly. Support for conditional sections. 1997-11-13 11:45:31 +00:00
James Clark
9501138aae Realistic prolog tokenization. 1997-11-13 09:05:46 +00:00
James Clark
9651443ca7 Support for line and column numbers 1997-11-12 10:38:58 +00:00
James Clark
84be77b536 Update comment. 1997-11-11 06:41:38 +00:00