Andre/Mike Brain Fart todo

From SnOwy - Ed's Wiki Notebook

Jump to: navigation, search

Log

A log of work completed on this project. Notice that Andre is working on the Kmer counting, Mike is doing the sequencing and sampling, and I've been charged with the G?SOM.

20100430 To Do

Attached is a runnable JAR and the Eclipse project source. If you want to edit
the source, you'll need an additional library...but that's straight forward.

Anyway, to run java -jar brainfart.jar $a $k
$a = alphabet: nt for nucleotide or p for protein
$k = length of kmer.
--
--Andre Masella (andre@masella.no-ip.org, masella.no-ip.org, PGP:10ECBAE0)

An engineer is someone who does list processing in FORTRAN.
2 attachments — Download all attachments  
source.tar.gz	source.tar.gz
4K   Download  
brainfart.jar	brainfart.jar
17K   Download   

20100426 To Do

kmer and ksize in mothur

Conclusion re: source of inefficiency

Challenges and Recommendations

20100426 Remarks

Training SOM phases

Initialization

Ordering, Fine-Tuning

w(t+1) = w(t) + α*h*[x(k) - w(t)];
let α be the learning rate
let h be the neighbourhood kernel function
let x at k be the kth input vector
let x(k) and w(t) have the same dimensions -- equal to the number of components in the input vector

GSOM Modifications

GT = -D * ln(SF)
let 'D' be the dimensionality of the data.

Ewinner(t + 1) = Ewinner(t) + ||x(k) - wwinner(t)||

Questions for Andre

Andre (Poki)
GTalk
0426.21:47
I will strip gaps. I strip whitespace, I'll make "-" == " "
0426.21:47
Sliding window.

20100425 Remarks

Competitive Learning incl. G?SOM

Mothur

20100423 To Do

Briefings

Having discussed the project with Mike and Andre more extensively, it became a bit clear that we have the opportunity to introduce new clustering techniques to the taxonomy culture.

Taxonomy Culture

What we will be offering

Kmer counts, SOM

Other considerations - Naïve Bayesian classifier

Additional history - GSOM Paper

Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox