|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcodesimian.CS<CSGeneric>
codesimian.DefaultCS
codesimian.MostCommonSubstrings
public class MostCommonSubstrings
The purpose of this class is to keep statistics on the most common
substrings of random samples of text found in codesimian at random times,
and from the internet and possibly other sources,
and to use those statistics to parse natural-language better.
Example: mostCommonSubstrings(list(' ' 'a' 'b' 'common substrings'...)
list(.17 .05 .03 .001...) optional_changeSpeedFraction optional_maxListSize
'optionally add all short substrings of this text to P0 and P1')
MostCommonSubstrings has 2 lists (in P0 and P1). One is a list of strings,
ordered more common at lower index, and the other is a list of their
relative frequencies that sums to 1 (except for roundoff error).
For any normal english text, the P0 list will start with space, vowels, other letters,
then combinations of 1 letter and 1 space, then combinations of 2 letters,
combinations of 3 letters. Common phrases can be before rare words despite being longer.
P2 is a fraction between 0 and 1 but should be very close to 0
for best statistics (example: .0001). Use a smaller P2 if you use more input text.
P2 is how fast the statistics will change with new text input.
If P2 is 0, P3 is ignored.
If P2 is 1, P3 overwrites all existing statistics of this MostCommonSubstrings.
If P2 is very close to 0, the statistics slowly and accurately learn.
Default is .001, if there is no P2.
P3 is max P(0).countP(), which equals P(1).countP().
This is the total quantity of unique strings that this MostCommonSubstrings keeps statistics on.
Default is 1000, if there is no P3.
P4 is input text, and is input every time this MostCommonSubstrings is EXECUTED.
Default is to not input any more text, if there is no P4.
Field Summary | |
---|---|
static double |
defaultChangeSpeedFraction
P2 |
static double |
defaultMaxListSize
P3 |
Fields inherited from class codesimian.CS |
---|
DESCRIPTION, END, EXECPROXY, HEAP, JAVACODE, MYFUEL, NAME, NEWINSTANCE, NULL, PARENT, PARSEPRIORITY, PREV, TESTER, THIS |
Constructor Summary | |
---|---|
MostCommonSubstrings()
|
Method Summary | |
---|---|
java.lang.String |
description()
a short description of this CS, shorter than the javadoc, but long enough to tell what the params are for. |
double |
DForProxy()
Execute this CS and cast to double. |
int |
maxP()
Maximum quantity of Params |
int |
minP()
For DForProxy(). Minimum number of parameters in param[] needed to call DForProxy(). Defines which indexs of param[] DForProxy() can use. Functions with a different number of parameters must override this. OVERRIDE THIS FUNCTION IF EXEC USES A DIFFERENT NUMBER OF PARAMETERS. Default is 1. |
Methods inherited from class codesimian.DefaultCS |
---|
B, C, countP, decrementMyFuel, deleteP, F, fuel, getExec, getObject, heap, I, indexP, indexPName, insertB, insertC, insertD, insertF, insertI, insertJ, insertL, insertL, insertL1, insertP, insertS, insertZ, J, javaCode, keyword, LForProxy, LForProxy, myFuel, name, newInstance, objectToCS, objectToCSArray, objectToCSArray, P, prevD, prevL, PType, S, setB, setC, setCountP, setD, setD, setExec, setF, setFuel, setI, setJ, setL, setL, setL, setL1, setMyFuel, setName, setObject, setP, setPrevExec, setPType, setS, setZ, start, toString, V, Z |
Methods inherited from class codesimian.CS |
---|
addB, addC, addD, addF, addI, addJ, addL, addP, addP, addP, addP, addP, addS, addZ, BForProxy, CForProxy, clone, cost, D, deleteP, FForProxy, GETB, GETC, GETD, GETF, GETI, GETJ, GETL, GETS, GETZ, IForProxy, isIllusion, JForProxy, L, L, L, L, L, maxD, minD, overwrites, parent, parsePriority, PB, PC, PD, PF, PI, PJ, PL, prevB, prevC, prevF, prevI, prevJ, prevS, prevZ, proxyOf, PS, PZ, reflect, reflect, reflect6, setB, SETB, setC, SETC, setCost, SETD, setDescription, setF, SETF, setHeap, setI, SETI, setJ, SETJ, SETL, setL, setL, setParent, setParsePriority, setProxyOf, setS, SETS, setTester, setZ, SETZ, SForProxy, tester, toJavaCode, VForProxy, voidReflect, ZForProxy |
Methods inherited from class java.lang.Object |
---|
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final double defaultChangeSpeedFraction
public static final double defaultMaxListSize
Constructor Detail |
---|
public MostCommonSubstrings()
Method Detail |
---|
public double DForProxy()
CS
DForProxy
in class DefaultCS
public int minP()
DefaultCS
minP
in class DefaultCS
public int maxP()
CS
maxP
in class CS
public java.lang.String description()
CS
description
in class DefaultCS
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |