- All Implemented Interfaces:
- InputFormat<Record,FileInputSplit>, Serializable
public class PointInFormat
extends DelimitedInputFormat
Generates records with an id and a and CoordVector.
The input format is line-based, i.e. one record is read from one line
which is terminated by '\n'. Within a line the first '|' character separates
the id from the the CoordVector. The vector consists of a vector of decimals.
The decimals are separated by '|' as well. The id is the id of a data point or
cluster center and the CoordVector the corresponding position (coordinate
vector) of the data point or cluster center. Example line:
"42|23.23|52.57|74.43| Id: 42 Coordinate vector: (23.23, 52.57, 74.43)
- See Also:
- Serialized Form