<div class="section-body">
<ul>
- <li><a href="./test/tc_porter2_full_rb.html?TB_iframe=true&height=550&width=785"
- class="thickbox" title="test/tc_porter2_full.rb">test/tc_porter2_full.rb</a></li>
-
<li><a href="./lib/porter2_rb.html?TB_iframe=true&height=550&width=785"
class="thickbox" title="lib/porter2.rb">lib/porter2.rb</a></li>
<!-- Method Quickref -->
-
- <!-- Included Modules -->
-
- <div id="includes-section" class="section">
- <h3 class="section-header">Included Modules</h3>
+ <div id="method-list-section" class="section">
+ <h3 class="section-header">Methods</h3>
<ul class="link-list">
-
-
- <li><a class="include" href="Stemmable.html">Stemmable</a></li>
-
-
+
+ <li><a href="#method-i-porter2_ends_with_short_syllable%3F">#porter2_ends_with_short_syllable?</a></li>
+
+ <li><a href="#method-i-porter2_is_short_word%3F">#porter2_is_short_word?</a></li>
+
+ <li><a href="#method-i-porter2_postprocess">#porter2_postprocess</a></li>
+
+ <li><a href="#method-i-porter2_preprocess">#porter2_preprocess</a></li>
+
+ <li><a href="#method-i-porter2_r1">#porter2_r1</a></li>
+
+ <li><a href="#method-i-porter2_r2">#porter2_r2</a></li>
+
+ <li><a href="#method-i-porter2_stem">#porter2_stem</a></li>
+
+ <li><a href="#method-i-porter2_stem_verbose">#porter2_stem_verbose</a></li>
+
+ <li><a href="#method-i-porter2_step0">#porter2_step0</a></li>
+
+ <li><a href="#method-i-porter2_step1a">#porter2_step1a</a></li>
+
+ <li><a href="#method-i-porter2_step1b">#porter2_step1b</a></li>
+
+ <li><a href="#method-i-porter2_step1c">#porter2_step1c</a></li>
+
+ <li><a href="#method-i-porter2_step2">#porter2_step2</a></li>
+
+ <li><a href="#method-i-porter2_step3">#porter2_step3</a></li>
+
+ <li><a href="#method-i-porter2_step4">#porter2_step4</a></li>
+
+ <li><a href="#method-i-porter2_step5">#porter2_step5</a></li>
+
+ <li><a href="#method-i-porter2_tidy">#porter2_tidy</a></li>
+
+ <li><a href="#method-i-stem">#stem</a></li>
+
</ul>
</div>
+
+ <!-- Included Modules -->
+
</div>
<div id="project-metadata">
<ul class="link-list">
- <li><a href="./Stemmable.html">Stemmable</a></li>
+ <li><a href="./Porter2.html">Porter2</a></li>
<li><a href="./String.html">String</a></li>
<h1 class="class">String</h1>
<div id="description">
- <p>
-Add stem method to all Strings\r
+ <h2>The Porter 2 stemmer</h2>
+<p>
+This is the Porter 2 stemming algorithm, as described at <a
+href="http://snowball.tartarus.org/algorithms/english/stemmer.html">snowball.tartarus.org/algorithms/english/stemmer.html</a>
+The original paper is:
+</p>
+<p>
+Porter, 1980, “An algorithm for suffix stripping”,
+<em>Program</em>, Vol. 14, no. 3, pp 130-137
+</p>
+<p>
+Constants for the stemmer are in the <a href="Porter2.html">Porter2</a>
+module.
+</p>
+<p>
+Procedures that implement the stemmer are added to the <a
+href="String.html">String</a> class.
+</p>
+<p>
+The stemmer algorithm is implemented in the <a
+href="String.html#method-i-porter2_stem">porter2_stem</a> procedure.
+</p>
+<h2>Internationalisation</h2>
+<p>
+There isn’t much, as this is a stemmer that only works for English.
+</p>
+<p>
+The <tt>gb_english</tt> flag to the various procedures allows the stemmer
+to treat the British English ’-ise’ the same as the American
+English ’-ize’.
+</p>
+<h2>Longest suffixes</h2>
+<p>
+Several places in the algorithm require matching the longest suffix of a
+word. The regexp engine in Ruby 1.9 seems to handle alterntives in regexps
+by finding the alternative that matches at the first position in the
+string. As we’re only talking about suffixes, that first match is
+also the longest suffix. If the regexp engine changes, this behaviour may
+change and break the stemmer.
</p>
</div>
<!-- Methods -->
+ <div id="public-instance-method-details" class="method-section section">
+ <h3 class="section-header">Public Instance Methods</h3>
+
+
+ <div id="porter-ends-with-short-syllable--method" class="method-detail ">
+ <a name="method-i-porter2_ends_with_short_syllable%3F"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_ends_with_short_syllable?</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Returns true if the word ends with a short syllable
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-ends-with-short-syllable--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 87</span>
+87: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_ends_with_short_syllable?</span>
+88: <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::SHORT_SYLLABLE}$/</span> <span class="ruby-operator">?</span> <span class="ruby-keyword kw">true</span> <span class="ruby-operator">:</span> <span class="ruby-keyword kw">false</span>
+89: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-is-short-word--method" class="method-detail ">
+ <a name="method-i-porter2_is_short_word%3F"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_is_short_word?</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+A word is short if it ends in a short syllable, and R1 is null
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-is-short-word--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 93</span>
+93: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_is_short_word?</span>
+94: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_ends_with_short_syllable?</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span>.<span class="ruby-identifier">empty?</span>
+95: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-postprocess-method" class="method-detail ">
+ <a name="method-i-porter2_postprocess"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_postprocess</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Turn all Y letters into y
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-postprocess-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 289</span>
+289: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_postprocess</span>
+290: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">gsub</span>(<span class="ruby-regexp re">/Y/</span>, <span class="ruby-value str">'y'</span>)
+291: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-preprocess-method" class="method-detail ">
+ <a name="method-i-porter2_preprocess"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_preprocess</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Preprocess the word. Remove any initial ’, if present. Then, set
+initial y, or y after a vowel, to Y
+</p>
+<p>
+(The comment to ‘establish the regions R1 and R2’ in the
+original description is an implementation optimisation that identifies
+where the regions start. As no modifications are made to the word that
+affect those positions, you may want to cache them now. This implementation
+doesn’t do that.)
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-preprocess-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 53</span>
+53: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_preprocess</span>
+54: <span class="ruby-identifier">w</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">dup</span>
+55:
+56: <span class="ruby-comment cmt"># remove any initial apostrophe</span>
+57: <span class="ruby-identifier">w</span>.<span class="ruby-identifier">gsub!</span>(<span class="ruby-regexp re">/^'*(.)/</span>, <span class="ruby-value str">'\1'</span>)
+58:
+59: <span class="ruby-comment cmt"># set initial y, or y after a vowel, to Y</span>
+60: <span class="ruby-identifier">w</span>.<span class="ruby-identifier">gsub!</span>(<span class="ruby-regexp re">/^y/</span>, <span class="ruby-value str">"Y"</span>)
+61: <span class="ruby-identifier">w</span>.<span class="ruby-identifier">gsub!</span>(<span class="ruby-node">/(#{Porter2::V})y/</span>, <span class="ruby-value str">'\1Y'</span>)
+62:
+63: <span class="ruby-identifier">w</span>
+64: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-r--method" class="method-detail ">
+ <a name="method-i-porter2_r1"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_r1</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+R1 is the portion of the word after the first non-vowel after the first
+vowel (with words beginning ‘gener-’, ‘commun-’,
+and ‘arsen-’ treated as special cases
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-r--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 69</span>
+69: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_r1</span>
+70: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/^(gener|commun|arsen)(?<r1>.*)/</span>
+71: <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">last_match</span>(<span class="ruby-value">:r1</span>)
+72: <span class="ruby-keyword kw">else</span>
+73: <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::V}#{Porter2::C}(?<r1>.*)$/</span>
+74: <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">last_match</span>(<span class="ruby-value">:r1</span>) <span class="ruby-operator">||</span> <span class="ruby-value str">""</span>
+75: <span class="ruby-keyword kw">end</span>
+76: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-r--method" class="method-detail ">
+ <a name="method-i-porter2_r2"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_r2</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+R2 is the portion of R1 (<a
+href="String.html#method-i-porter2_r1">porter2_r1</a>) after the first
+non-vowel after the first vowel
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-r--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 80</span>
+80: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_r2</span>
+81: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::V}#{Porter2::C}(?<r2>.*)$/</span>
+82: <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">last_match</span>(<span class="ruby-value">:r2</span>) <span class="ruby-operator">||</span> <span class="ruby-value str">""</span>
+83: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-stem-method" class="method-detail ">
+ <a name="method-i-porter2_stem"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_stem</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Perform the stemming procedure. If <tt>gb_english</tt> is true, treat
+’-ise’ and similar suffixes as ’-ize’ in American
+English.
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-stem-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 297</span>
+297: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_stem</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+298: <span class="ruby-identifier">preword</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_tidy</span>
+299: <span class="ruby-keyword kw">return</span> <span class="ruby-identifier">preword</span> <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">length</span> <span class="ruby-operator"><=</span> <span class="ruby-value">2</span>
+300:
+301: <span class="ruby-identifier">word</span> = <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">porter2_preprocess</span>
+302:
+303: <span class="ruby-keyword kw">if</span> <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">SPECIAL_CASES</span>.<span class="ruby-identifier">has_key?</span> <span class="ruby-identifier">word</span>
+304: <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">SPECIAL_CASES</span>[<span class="ruby-identifier">word</span>]
+305: <span class="ruby-keyword kw">else</span>
+306: <span class="ruby-identifier">w1a</span> = <span class="ruby-identifier">word</span>.<span class="ruby-identifier">porter2_step0</span>.<span class="ruby-identifier">porter2_step1a</span>
+307: <span class="ruby-keyword kw">if</span> <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">STEP_1A_SPECIAL_CASES</span>.<span class="ruby-identifier">include?</span> <span class="ruby-identifier">w1a</span>
+308: <span class="ruby-identifier">w1a</span>
+309: <span class="ruby-keyword kw">else</span>
+310: <span class="ruby-identifier">w1a</span>.<span class="ruby-identifier">porter2_step1b</span>(<span class="ruby-identifier">gb_english</span>).<span class="ruby-identifier">porter2_step1c</span>.<span class="ruby-identifier">porter2_step2</span>(<span class="ruby-identifier">gb_english</span>).<span class="ruby-identifier">porter2_step3</span>(<span class="ruby-identifier">gb_english</span>).<span class="ruby-identifier">porter2_step4</span>(<span class="ruby-identifier">gb_english</span>).<span class="ruby-identifier">porter2_step5</span>.<span class="ruby-identifier">porter2_postprocess</span>
+311: <span class="ruby-keyword kw">end</span>
+312: <span class="ruby-keyword kw">end</span>
+313: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+ <div class="aliases">
+ Also aliased as: <a href="String.html#method-i-stem">stem</a>
+ </div>
+
+
+
+ </div>
+
+
+ <div id="porter-stem-verbose-method" class="method-detail ">
+ <a name="method-i-porter2_stem_verbose"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_stem_verbose</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+A verbose version of <a
+href="String.html#method-i-porter2_stem">porter2_stem</a> that prints the
+output of each stage to STDOUT
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-stem-verbose-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 316</span>
+316: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_stem_verbose</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+317: <span class="ruby-identifier">preword</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_tidy</span>
+318: <span class="ruby-identifier">puts</span> <span class="ruby-node">"Preword: #{preword}"</span>
+319: <span class="ruby-keyword kw">return</span> <span class="ruby-identifier">preword</span> <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">length</span> <span class="ruby-operator"><=</span> <span class="ruby-value">2</span>
+320:
+321: <span class="ruby-identifier">word</span> = <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">porter2_preprocess</span>
+322: <span class="ruby-identifier">puts</span> <span class="ruby-node">"Preprocessed: #{word}"</span>
+323:
+324: <span class="ruby-keyword kw">if</span> <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">SPECIAL_CASES</span>.<span class="ruby-identifier">has_key?</span> <span class="ruby-identifier">word</span>
+325: <span class="ruby-identifier">puts</span> <span class="ruby-node">"Returning #{word} as special case #{Porter2::SPECIAL_CASES[word]}"</span>
+326: <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">SPECIAL_CASES</span>[<span class="ruby-identifier">word</span>]
+327: <span class="ruby-keyword kw">else</span>
+328: <span class="ruby-identifier">r1</span> = <span class="ruby-identifier">word</span>.<span class="ruby-identifier">porter2_r1</span>
+329: <span class="ruby-identifier">r2</span> = <span class="ruby-identifier">word</span>.<span class="ruby-identifier">porter2_r2</span>
+330: <span class="ruby-identifier">puts</span> <span class="ruby-node">"R1 = #{r1}, R2 = #{r2}"</span>
+331:
+332: <span class="ruby-identifier">w0</span> = <span class="ruby-identifier">word</span>.<span class="ruby-identifier">porter2_step0</span> ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 0: #{w0} (R1 = #{w0.porter2_r1}, R2 = #{w0.porter2_r2})"</span>
+333: <span class="ruby-identifier">w1a</span> = <span class="ruby-identifier">w0</span>.<span class="ruby-identifier">porter2_step1a</span> ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 1a: #{w1a} (R1 = #{w1a.porter2_r1}, R2 = #{w1a.porter2_r2})"</span>
+334:
+335: <span class="ruby-keyword kw">if</span> <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">STEP_1A_SPECIAL_CASES</span>.<span class="ruby-identifier">include?</span> <span class="ruby-identifier">w1a</span>
+336: <span class="ruby-identifier">puts</span> <span class="ruby-node">"Returning #{w1a} as 1a special case"</span>
+337: <span class="ruby-identifier">w1a</span>
+338: <span class="ruby-keyword kw">else</span>
+339: <span class="ruby-identifier">w1b</span> = <span class="ruby-identifier">w1a</span>.<span class="ruby-identifier">porter2_step1b</span>(<span class="ruby-identifier">gb_english</span>) ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 1b: #{w1b} (R1 = #{w1b.porter2_r1}, R2 = #{w1b.porter2_r2})"</span>
+340: <span class="ruby-identifier">w1c</span> = <span class="ruby-identifier">w1b</span>.<span class="ruby-identifier">porter2_step1c</span> ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 1c: #{w1c} (R1 = #{w1c.porter2_r1}, R2 = #{w1c.porter2_r2})"</span>
+341: <span class="ruby-identifier">w2</span> = <span class="ruby-identifier">w1c</span>.<span class="ruby-identifier">porter2_step2</span>(<span class="ruby-identifier">gb_english</span>) ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 2: #{w2} (R1 = #{w2.porter2_r1}, R2 = #{w2.porter2_r2})"</span>
+342: <span class="ruby-identifier">w3</span> = <span class="ruby-identifier">w2</span>.<span class="ruby-identifier">porter2_step3</span>(<span class="ruby-identifier">gb_english</span>) ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 3: #{w3} (R1 = #{w3.porter2_r1}, R2 = #{w3.porter2_r2})"</span>
+343: <span class="ruby-identifier">w4</span> = <span class="ruby-identifier">w3</span>.<span class="ruby-identifier">porter2_step4</span>(<span class="ruby-identifier">gb_english</span>) ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 4: #{w4} (R1 = #{w4.porter2_r1}, R2 = #{w4.porter2_r2})"</span>
+344: <span class="ruby-identifier">w5</span> = <span class="ruby-identifier">w4</span>.<span class="ruby-identifier">porter2_step5</span> ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After step 5: #{w5}"</span>
+345: <span class="ruby-identifier">wpost</span> = <span class="ruby-identifier">w5</span>.<span class="ruby-identifier">porter2_postprocess</span> ; <span class="ruby-identifier">puts</span> <span class="ruby-node">"After postprocess: #{wpost}"</span>
+346: <span class="ruby-identifier">wpost</span>
+347: <span class="ruby-keyword kw">end</span>
+348: <span class="ruby-keyword kw">end</span>
+349: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step--method" class="method-detail ">
+ <a name="method-i-porter2_step0"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step0</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the suffixes,
+</p>
+<ul>
+<li><p>
+‘
+</p>
+</li>
+<li><p>
+’s
+</p>
+</li>
+<li><p>
+’s’
+</p>
+</li>
+</ul>
+<p>
+and remove if found.
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 103</span>
+103: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step0</span>
+104: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub!</span>(<span class="ruby-regexp re">/(.)('s'|'s|')$/</span>, <span class="ruby-value str">'\1'</span>) <span class="ruby-operator">||</span> <span class="ruby-keyword kw">self</span>
+105: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step-a-method" class="method-detail ">
+ <a name="method-i-porter2_step1a"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step1a</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the following suffixes, and perform the action
+indicated.
+</p>
+<table>
+<tr><td valign="top">sses</td><td><p>
+replace by ss
+</p>
+</td></tr>
+<tr><td valign="top">ied, ies</td><td><p>
+replace by i if preceded by more than one letter, otherwise by ie
+</p>
+</td></tr>
+<tr><td valign="top">s</td><td><p>
+delete if the preceding word part contains a vowel not immediately before
+the s
+</p>
+</td></tr>
+<tr><td valign="top">us, ss</td><td><p>
+do nothing
+</p>
+</td></tr>
+</table>
+
+
+
+ <div class="method-source-code"
+ id="porter-step-a-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 113</span>
+113: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step1a</span>
+114: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/sses$/</span>
+115: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/sses$/</span>, <span class="ruby-value str">'ss'</span>)
+116: <span class="ruby-keyword kw">elsif</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/..(ied|ies)$/</span>
+117: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/(ied|ies)$/</span>, <span class="ruby-value str">'i'</span>)
+118: <span class="ruby-keyword kw">elsif</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(ied|ies)$/</span>
+119: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/(ied|ies)$/</span>, <span class="ruby-value str">'ie'</span>)
+120: <span class="ruby-keyword kw">elsif</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(us|ss)$/</span>
+121: <span class="ruby-keyword kw">self</span>
+122: <span class="ruby-keyword kw">elsif</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/s$/</span>
+123: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/(#{Porter2::V}.+)s$/</span>
+124: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/s$/</span>, <span class="ruby-value str">''</span>)
+125: <span class="ruby-keyword kw">else</span>
+126: <span class="ruby-keyword kw">self</span>
+127: <span class="ruby-keyword kw">end</span>
+128: <span class="ruby-keyword kw">else</span>
+129: <span class="ruby-keyword kw">self</span>
+130: <span class="ruby-keyword kw">end</span>
+131: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step-b-method" class="method-detail ">
+ <a name="method-i-porter2_step1b"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step1b</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the following suffixes, and perform the action
+indicated.
+</p>
+<table>
+<tr><td valign="top">eed, eedly</td><td><p>
+replace by ee if the suffix is also in R1
+</p>
+</td></tr>
+<tr><td valign="top">ed, edly, ing, ingly</td><td><p>
+delete if the preceding word part contains a vowel and, after the
+deletion:
+</p>
+<ul>
+<li><p>
+if the word ends at, bl or iz: add e, or
+</p>
+</li>
+</ul>
+<ul>
+<li><p>
+if the word ends with a double: remove the last letter, or
+</p>
+</li>
+</ul>
+<ul>
+<li><p>
+if the word is short: add e
+</p>
+</li>
+</ul>
+</td></tr>
+</table>
+<p>
+(If gb_english is <tt>true</tt>, treat the ‘is’ suffix as
+‘iz’ above.)
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step-b-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 143</span>
+143: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step1b</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+144: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(eed|eedly)$/</span>
+145: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(eed|eedly)$/</span>
+146: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/(eed|eedly)$/</span>, <span class="ruby-value str">'ee'</span>)
+147: <span class="ruby-keyword kw">else</span>
+148: <span class="ruby-keyword kw">self</span>
+149: <span class="ruby-keyword kw">end</span>
+150: <span class="ruby-keyword kw">else</span>
+151: <span class="ruby-identifier">w</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">dup</span>
+152: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">w</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::V}.*(ed|edly|ing|ingly)$/</span>
+153: <span class="ruby-identifier">w</span>.<span class="ruby-identifier">sub!</span>(<span class="ruby-regexp re">/(ed|edly|ing|ingly)$/</span>, <span class="ruby-value str">''</span>)
+154: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">w</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(at|lb|iz)$/</span>
+155: <span class="ruby-identifier">w</span> <span class="ruby-operator">+=</span> <span class="ruby-value str">'e'</span>
+156: <span class="ruby-keyword kw">elsif</span> <span class="ruby-identifier">w</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/is$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-identifier">gb_english</span>
+157: <span class="ruby-identifier">w</span> <span class="ruby-operator">+=</span> <span class="ruby-value str">'e'</span>
+158: <span class="ruby-keyword kw">elsif</span> <span class="ruby-identifier">w</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::Double}$/</span>
+159: <span class="ruby-identifier">w</span>.<span class="ruby-identifier">chop!</span>
+160: <span class="ruby-keyword kw">elsif</span> <span class="ruby-identifier">w</span>.<span class="ruby-identifier">porter2_is_short_word?</span>
+161: <span class="ruby-identifier">w</span> <span class="ruby-operator">+=</span> <span class="ruby-value str">'e'</span>
+162: <span class="ruby-keyword kw">end</span>
+163: <span class="ruby-keyword kw">end</span>
+164: <span class="ruby-identifier">w</span>
+165: <span class="ruby-keyword kw">end</span>
+166: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step-c-method" class="method-detail ">
+ <a name="method-i-porter2_step1c"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step1c</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Replace a suffix of y or Y by i if it is preceded by a non-vowel which is
+not the first letter of the word.
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step-c-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 171</span>
+171: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step1c</span>
+172: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/.+#{Porter2::C}(y|Y)$/</span>
+173: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/(y|Y)$/</span>, <span class="ruby-value str">'i'</span>)
+174: <span class="ruby-keyword kw">else</span>
+175: <span class="ruby-keyword kw">self</span>
+176: <span class="ruby-keyword kw">end</span>
+177: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step--method" class="method-detail ">
+ <a name="method-i-porter2_step2"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step2</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the suffixes listed in the keys of
+Porter2::STEP_2_MAPS. If one is found and that suffix occurs in R1,
+replace it with the value found in STEP_2_MAPS.
+</p>
+<p>
+(Suffixes ‘ogi’ and ‘li’ are treated as special
+cases in the procedure.)
+</p>
+<p>
+(If gb_english is <tt>true</tt>, replace the ‘iser’ and
+‘isation’ suffixes with ‘ise’, similarly to how
+‘izer’ and ‘ization’ are treated.)
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 188</span>
+188: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step2</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+189: <span class="ruby-identifier">r1</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span>
+190: <span class="ruby-identifier">s2m</span> = <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">STEP_2_MAPS</span>.<span class="ruby-identifier">dup</span>
+191: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">gb_english</span>
+192: <span class="ruby-identifier">s2m</span>[<span class="ruby-value str">"iser"</span>] = <span class="ruby-value str">"ise"</span>
+193: <span class="ruby-identifier">s2m</span>[<span class="ruby-value str">"isation"</span>] = <span class="ruby-value str">"ise"</span>
+194: <span class="ruby-keyword kw">end</span>
+195: <span class="ruby-identifier">step_2_re</span> = <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">union</span>(<span class="ruby-identifier">s2m</span>.<span class="ruby-identifier">keys</span>.<span class="ruby-identifier">map</span> {<span class="ruby-operator">|</span><span class="ruby-identifier">r</span><span class="ruby-operator">|</span> <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">new</span>(<span class="ruby-identifier">r</span> <span class="ruby-operator">+</span> <span class="ruby-value str">"$"</span>)})
+196: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-identifier">step_2_re</span>
+197: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">r1</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{$&}$/</span>
+198: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-node">/#{$&}$/</span>, <span class="ruby-identifier">s2m</span>[<span class="ruby-node">$&</span>])
+199: <span class="ruby-keyword kw">else</span>
+200: <span class="ruby-keyword kw">self</span>
+201: <span class="ruby-keyword kw">end</span>
+202: <span class="ruby-keyword kw">elsif</span> <span class="ruby-identifier">r1</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/li$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/(#{Porter2::Valid_LI})li$/</span>
+203: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/li$/</span>, <span class="ruby-value str">''</span>)
+204: <span class="ruby-keyword kw">elsif</span> <span class="ruby-identifier">r1</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/ogi$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/logi$/</span>
+205: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/ogi$/</span>, <span class="ruby-value str">'og'</span>)
+206: <span class="ruby-keyword kw">else</span>
+207: <span class="ruby-keyword kw">self</span>
+208: <span class="ruby-keyword kw">end</span>
+209: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step--method" class="method-detail ">
+ <a name="method-i-porter2_step3"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step3</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the suffixes listed in the keys of
+Porter2::STEP_3_MAPS. If one is found and that suffix occurs in R1,
+replace it with the value found in STEP_3_MAPS.
+</p>
+<p>
+(Suffix ‘ative’ is treated as a special case in the procedure.)
+</p>
+<p>
+(If gb_english is <tt>true</tt>, replace the ‘alise’ suffix
+with ‘al’, similarly to how ‘alize’ is treated.)
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 220</span>
+220: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step3</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+221: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/ative$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r2</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/ative$/</span>
+222: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/ative$/</span>, <span class="ruby-value str">''</span>)
+223: <span class="ruby-keyword kw">else</span>
+224: <span class="ruby-identifier">s3m</span> = <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">STEP_3_MAPS</span>.<span class="ruby-identifier">dup</span>
+225: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">gb_english</span>
+226: <span class="ruby-identifier">s3m</span>[<span class="ruby-value str">"alise"</span>] = <span class="ruby-value str">"al"</span>
+227: <span class="ruby-keyword kw">end</span>
+228: <span class="ruby-identifier">step_3_re</span> = <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">union</span>(<span class="ruby-identifier">s3m</span>.<span class="ruby-identifier">keys</span>.<span class="ruby-identifier">map</span> {<span class="ruby-operator">|</span><span class="ruby-identifier">r</span><span class="ruby-operator">|</span> <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">new</span>(<span class="ruby-identifier">r</span> <span class="ruby-operator">+</span> <span class="ruby-value str">"$"</span>)})
+229: <span class="ruby-identifier">r1</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span>
+230: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-identifier">step_3_re</span> <span class="ruby-keyword kw">and</span> <span class="ruby-identifier">r1</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{$&}$/</span>
+231: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-node">/#{$&}$/</span>, <span class="ruby-identifier">s3m</span>[<span class="ruby-node">$&</span>])
+232: <span class="ruby-keyword kw">else</span>
+233: <span class="ruby-keyword kw">self</span>
+234: <span class="ruby-keyword kw">end</span>
+235: <span class="ruby-keyword kw">end</span>
+236: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step--method" class="method-detail ">
+ <a name="method-i-porter2_step4"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step4</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the longest among the suffixes listed in the keys of
+Porter2::STEP_4_MAPS. If one is found and that suffix occurs in R2,
+replace it with the value found in STEP_4_MAPS.
+</p>
+<p>
+(Suffix ‘ion’ is treated as a special case in the procedure.)
+</p>
+<p>
+(If gb_english is <tt>true</tt>, delete the ‘ise’ suffix if
+found.)
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-step--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 246</span>
+246: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step4</span>(<span class="ruby-identifier">gb_english</span> = <span class="ruby-keyword kw">false</span>)
+247: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r2</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/ion$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/(s|t)ion$/</span>
+248: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/ion$/</span>, <span class="ruby-value str">''</span>)
+249: <span class="ruby-keyword kw">else</span>
+250: <span class="ruby-identifier">s4m</span> = <span class="ruby-constant">Porter2</span><span class="ruby-operator">::</span><span class="ruby-constant">STEP_4_MAPS</span>.<span class="ruby-identifier">dup</span>
+251: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">gb_english</span>
+252: <span class="ruby-identifier">s4m</span>[<span class="ruby-value str">"ise"</span>] = <span class="ruby-value str">""</span>
+253: <span class="ruby-keyword kw">end</span>
+254: <span class="ruby-identifier">step_4_re</span> = <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">union</span>(<span class="ruby-identifier">s4m</span>.<span class="ruby-identifier">keys</span>.<span class="ruby-identifier">map</span> {<span class="ruby-operator">|</span><span class="ruby-identifier">r</span><span class="ruby-operator">|</span> <span class="ruby-constant">Regexp</span>.<span class="ruby-identifier">new</span>(<span class="ruby-identifier">r</span> <span class="ruby-operator">+</span> <span class="ruby-value str">"$"</span>)})
+255: <span class="ruby-identifier">r2</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r2</span>
+256: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-identifier">step_4_re</span>
+257: <span class="ruby-keyword kw">if</span> <span class="ruby-identifier">r2</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{$&}/</span>
+258: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-node">/#{$&}$/</span>, <span class="ruby-identifier">s4m</span>[<span class="ruby-node">$&</span>])
+259: <span class="ruby-keyword kw">else</span>
+260: <span class="ruby-keyword kw">self</span>
+261: <span class="ruby-keyword kw">end</span>
+262: <span class="ruby-keyword kw">else</span>
+263: <span class="ruby-keyword kw">self</span>
+264: <span class="ruby-keyword kw">end</span>
+265: <span class="ruby-keyword kw">end</span>
+266: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-step--method" class="method-detail ">
+ <a name="method-i-porter2_step5"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_step5</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Search for the the following suffixes, and, if found, perform the action
+indicated.
+</p>
+<table>
+<tr><td valign="top">e</td><td><p>
+delete if in R2, or in R1 and not preceded by a short syllable
+</p>
+</td></tr>
+<tr><td valign="top">l</td><td><p>
+delete if in R2 and preceded by l
+</p>
+</td></tr>
+</table>
+
+
+
+ <div class="method-source-code"
+ id="porter-step--source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 272</span>
+272: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_step5</span>
+273: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/ll$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r2</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/l$/</span>
+274: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/ll$/</span>, <span class="ruby-value str">'l'</span>)
+275: <span class="ruby-keyword kw">elsif</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/e$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r2</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/e$/</span>
+276: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/e$/</span>, <span class="ruby-value str">''</span>)
+277: <span class="ruby-keyword kw">else</span>
+278: <span class="ruby-identifier">r1</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">porter2_r1</span>
+279: <span class="ruby-keyword kw">if</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/e$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-identifier">r1</span> <span class="ruby-operator">=~</span> <span class="ruby-regexp re">/e$/</span> <span class="ruby-keyword kw">and</span> <span class="ruby-keyword kw">not</span> <span class="ruby-keyword kw">self</span> <span class="ruby-operator">=~</span> <span class="ruby-node">/#{Porter2::SHORT_SYLLABLE}e$/</span>
+280: <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">sub</span>(<span class="ruby-regexp re">/e$/</span>, <span class="ruby-value str">''</span>)
+281: <span class="ruby-keyword kw">else</span>
+282: <span class="ruby-keyword kw">self</span>
+283: <span class="ruby-keyword kw">end</span>
+284: <span class="ruby-keyword kw">end</span>
+285: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="porter-tidy-method" class="method-detail ">
+ <a name="method-i-porter2_tidy"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">porter2_tidy</span><span
+ class="method-args">()</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+ <p>
+Tidy up the word before we get down to the algorithm
+</p>
+
+
+
+ <div class="method-source-code"
+ id="porter-tidy-source">
+<pre>
+ <span class="ruby-comment cmt"># File lib/porter2.rb, line 35</span>
+35: <span class="ruby-keyword kw">def</span> <span class="ruby-identifier">porter2_tidy</span>
+36: <span class="ruby-identifier">preword</span> = <span class="ruby-keyword kw">self</span>.<span class="ruby-identifier">to_s</span>.<span class="ruby-identifier">strip</span>.<span class="ruby-identifier">downcase</span>
+37:
+38: <span class="ruby-comment cmt"># map apostrophe-like characters to apostrophes</span>
+39: <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">gsub!</span>(<span class="ruby-regexp re">/‘/</span>, <span class="ruby-value str">"'"</span>)
+40: <span class="ruby-identifier">preword</span>.<span class="ruby-identifier">gsub!</span>(<span class="ruby-regexp re">/’/</span>, <span class="ruby-value str">"'"</span>)
+41:
+42: <span class="ruby-identifier">preword</span>
+43: <span class="ruby-keyword kw">end</span></pre>
+ </div>
+
+ </div>
+
+
+
+
+ </div>
+
+
+ <div id="stem-method" class="method-detail method-alias">
+ <a name="method-i-stem"></a>
+
+ <div class="method-heading">
+
+ <span class="method-name">stem</span><span
+ class="method-args">(gb_english = false)</span>
+ <span class="method-click-advice">click to toggle source</span>
+
+ </div>
+
+ <div class="method-description">
+
+
+
+
+
+ </div>
+
+
+
+
+ <div class="aliases">
+ Alias for: <a href="String.html#method-i-porter2_stem">porter2_stem</a>
+ </div>
+
+ </div>
+
+
+ </div>
+
</div>