sorry if novice question ask have started exploring rapidminer. have used cluster sample data [using k-means clustering]. query if use excel raw data file cluster, how excel data [output data] split k clusters in excel file. know how create cluster , switch between design , results screens.
thanks in advance.
hi , welcome stackoverflow , rapidminer.
if understand question correctly, read data excel, make clustering , want write single clusters excel.
if want manually can use "filter examples" operator , filter specific cluster. can automatically "loop values" operator, set loop attribute cluster , use iteration macro inside loop filter data. store data , use iteration macro file name.
see sample process below (you can copy , paste in xml panel directly in rapidminer):
<?xml version="1.0" encoding="utf-8" standalone="no"?> <process version="7.0.0"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="7.1.000-snapshot" expanded="true" name="process"> <process expanded="true"> <operator activated="true" class="generate_data" compatibility="7.1.000-snapshot" expanded="true" height="68" name="generate data" width="90" x="112" y="34"/> <operator activated="true" class="generate_id" compatibility="7.1.000-snapshot" expanded="true" height="82" name="generate id" width="90" x="246" y="34"/> <operator activated="true" class="k_means" compatibility="7.1.000-snapshot" expanded="true" height="82" name="clustering" width="90" x="447" y="34"> <parameter key="k" value="5"/> </operator> <operator activated="true" class="loop_values" compatibility="7.1.000-snapshot" expanded="true" height="82" name="loop values" width="90" x="715" y="34"> <parameter key="attribute" value="cluster"/> <process expanded="true"> <operator activated="true" breakpoints="after" class="filter_examples" compatibility="7.1.000-snapshot" expanded="true" height="103" name="filter examples" width="90" x="179" y="34"> <list key="filters_list"> <parameter key="filters_entry_key" value="cluster.equals.%{loop_value}"/> </list> </operator> <connect from_port="example set" to_op="filter examples" to_port="example set input"/> <connect from_op="filter examples" from_port="example set output" to_port="out 1"/> <portspacing port="source_example set" spacing="0"/> <portspacing port="sink_out 1" spacing="0"/> <portspacing port="sink_out 2" spacing="0"/> </process> </operator> <connect from_op="generate data" from_port="output" to_op="generate id" to_port="example set input"/> <connect from_op="generate id" from_port="example set output" to_op="clustering" to_port="example set"/> <connect from_op="clustering" from_port="clustered set" to_op="loop values" to_port="example set"/> <connect from_op="loop values" from_port="out 1" to_port="result 1"/> <portspacing port="source_input 1" spacing="0"/> <portspacing port="sink_result 1" spacing="0"/> <portspacing port="sink_result 2" spacing="0"/> </process> </operator> </process>
Comments
Post a Comment