root / tags / 0.5 / CHANGELOG

Revision 107, 2.6 kB (checked in by why, 22 months ago)
  • CHANGELOG: a summary of 0.5.
Line 
1= 0.5
2=== 31rd January, 2006
3
4* support for a[text()="Click Me!"] and h3[text()*="space"] and the like.
5* Hpricot.buffer_size accessor for increasing Hpricot's buffer if you're encountering huge ASP.NET viewstate attribs.
6* some support for colons in tag names (not full namespace support yet.)
7* Element.to_original_html will attempt to preserve the original HTML while merging your changes.
8* Element.to_plain_text converts an element's contents to a simple text format.
9* Element.inner_text removes all tags and returns text nodes concatenated into a single string.
10* no @raw_string variable kept for comments, text, and cdata -- as it's redundant.
11* xpath-style indices (//p/a[1]) but keep in mind that they aren't zero-based.
12* node_position is the index among all sibling nodes, while position is the position among children of identical type.
13* comment() and text() search criteria, like: //p/text(), which selects all text inside paragraph tags.
14* every element has css_path and xpath methods which return respective absolute paths.
15* more flexibility all around: in parsing attributes, tags, comments and cdata.
16
17= 0.4
18=== 11th August, 2006
19
20* The :fixup_tags option will try to sort out the hierarchy so elements end up with the right parents.
21* Elements such as *script* and *style* (identified as having CDATA contents) receive a single text node as their children now.  Previously, Hpricot was parsing out tags found in scripts.
22* Better scanning of partially quoted attributes (found by Brent Beardsly on http://uswebgen.com/)
23* Better scanning of unquoted attributes -- thanks to Aaron Patterson for the test cases!
24* Some tags were being output in the empty tag style, although browsers hated that.  FIXED!
25* Added Elements#at for finding single elements.
26* Added Elem::Trav#[] and Elem::Trav#[]= for reading and writing attributes.
27
28= 0.3
29=== 7th July, 2006
30
31* Fixed negative string size error on empty tokens. (news.bbc.co.uk)
32* Allow the parser to accept just text nodes. (such as: <tt>Hpricot.parse('TEXT')</tt>)
33* from JQuery to Hpricot::Elements: remove, empty, append, prepend, before, after, wrap, set,
34  html(...), to_html, to_s.
35* on containers: to_html, replace_child, insert_before, insert_after, innerHTML=.
36* Hpricot(...) is an alias for parse.
37* open up all properties to setters, let people do as they may.
38* use to_html for the full html of a node or set of elements.
39* doctypes were messed.
40
41= 0.2
42=== 4th July, 2006
43
44* Rewrote the HTree parser to be simpler, more adequate for the common man.  Will add encoding back in later.
45
46= 0.1
47=== 3rd July, 2006
48
49* For whatever reason, wrote this HTML parser in C.
50  I guess Ragel is addictive and I want to improve HTree.
Note: See TracBrowser for help on using the browser.