Nokogiri::HTML::SAX::Parser

class Nokogiri::HTML::SAX::Parser

Parent:
Nokogiri::XML::SAX::Parser

This class lets you perform SAX style parsing on HTML with HTML error correction.

Here is a basic usage example:

class MyDoc < Nokogiri::XML::SAX::Document
  def start_element name, attributes = []
    puts "found a #{name}"
  end
end

parser = Nokogiri::HTML::SAX::Parser.new(MyDoc.new)
parser.parse(File.read(ARGV[0], mode: 'rb'))

For more information on SAX parsers, see Nokogiri::XML::SAX

Public Instance Methods

parse_file(filename, encoding = 'UTF-8') { |ctx| ... } Show source

Parse a file with filename

# File lib/nokogiri/html/sax/parser.rb, line 41
def parse_file filename, encoding = 'UTF-8'
  raise ArgumentError unless filename
  raise Errno::ENOENT unless File.exist?(filename)
  raise Errno::EISDIR if File.directory?(filename)
  ctx = ParserContext.file(filename, encoding)
  yield ctx if block_given?
  ctx.parse_with self
end
parse_memory(data, encoding = 'UTF-8') { |ctx| ... } Show source

Parse html stored in data using encoding

# File lib/nokogiri/html/sax/parser.rb, line 31
def parse_memory data, encoding = 'UTF-8'
  raise ArgumentError unless data
  return unless data.length > 0
  ctx = ParserContext.memory(data, encoding)
  yield ctx if block_given?
  ctx.parse_with self
end

© 2008–2016 Aaron Patterson, Mike Dalessio, Charles Nutter, Sergio Arbeo
Patrick Mahoney, Yoko Harada, Akinori Musha, John Shahid
Licensed under the MIT License.

在线笔记
App下载
App下载

扫描二维码

下载编程狮App

公众号
微信公众号

编程狮公众号

意见反馈
返回顶部