Rapidminer - k-means query -


sorry if novice question ask have started exploring rapidminer. have used cluster sample data [using k-means clustering]. query if use excel raw data file cluster, how excel data [output data] split k clusters in excel file. know how create cluster , switch between design , results screens.

thanks in advance.

hi , welcome stackoverflow , rapidminer.

if understand question correctly, read data excel, make clustering , want write single clusters excel.
if want manually can use "filter examples" operator , filter specific cluster. can automatically "loop values" operator, set loop attribute cluster , use iteration macro inside loop filter data. store data , use iteration macro file name.
see sample process below (you can copy , paste in xml panel directly in rapidminer):

    <?xml version="1.0" encoding="utf-8" standalone="no"?> <process version="7.0.0">   <context>     <input/>     <output/>     <macros/>   </context>   <operator activated="true" class="process" compatibility="7.1.000-snapshot" expanded="true" name="process">     <process expanded="true">       <operator activated="true" class="generate_data" compatibility="7.1.000-snapshot" expanded="true" height="68" name="generate data" width="90" x="112" y="34"/>       <operator activated="true" class="generate_id" compatibility="7.1.000-snapshot" expanded="true" height="82" name="generate id" width="90" x="246" y="34"/>       <operator activated="true" class="k_means" compatibility="7.1.000-snapshot" expanded="true" height="82" name="clustering" width="90" x="447" y="34">         <parameter key="k" value="5"/>       </operator>       <operator activated="true" class="loop_values" compatibility="7.1.000-snapshot" expanded="true" height="82" name="loop values" width="90" x="715" y="34">         <parameter key="attribute" value="cluster"/>         <process expanded="true">           <operator activated="true" breakpoints="after" class="filter_examples" compatibility="7.1.000-snapshot" expanded="true" height="103" name="filter examples" width="90" x="179" y="34">             <list key="filters_list">               <parameter key="filters_entry_key" value="cluster.equals.%{loop_value}"/>             </list>           </operator>           <connect from_port="example set" to_op="filter examples" to_port="example set input"/>           <connect from_op="filter examples" from_port="example set output" to_port="out 1"/>           <portspacing port="source_example set" spacing="0"/>           <portspacing port="sink_out 1" spacing="0"/>           <portspacing port="sink_out 2" spacing="0"/>         </process>       </operator>       <connect from_op="generate data" from_port="output" to_op="generate id" to_port="example set input"/>       <connect from_op="generate id" from_port="example set output" to_op="clustering" to_port="example set"/>       <connect from_op="clustering" from_port="clustered set" to_op="loop values" to_port="example set"/>       <connect from_op="loop values" from_port="out 1" to_port="result 1"/>       <portspacing port="source_input 1" spacing="0"/>       <portspacing port="sink_result 1" spacing="0"/>       <portspacing port="sink_result 2" spacing="0"/>     </process>   </operator> </process> 

Comments