Wednesday, 18 May 2016

What are the most commonly defined input formats in Hadoop ?


1. Text Input Format : Default input format ; Key=Line offset , Value=Line

2. Key Value Input Format : for plain text files where the lines are broken into key and value.

3. Sequence File Input Format : used for reading sequence files

No comments:

Post a Comment

Note: only a member of this blog may post a comment.