Fixed titles. Removed title links from header for 0.15.

DennisDawson · DennisDawson · commit 78bfe7ffd4de · 2014-07-09T13:24:59.000-07:00
diff --git a/Column-Mapping.md b/Column-Mapping.md
@@ -3,8 +3,6 @@ layout: page
 title: Column Mapping
 ---
 
-## Column Mapping
-
 Column mapping allows you to configure how your records should be stored in HBase for maximum performance and efficiency. You define the column mapping in JSON format in a data-centric way. Kite stores and retrieves the data correctly.
 
 A column mapping is a JSON list of definitions that specify how to store each field in the record. Each definition is a JSON object with a `source`, a `type`, and any additional properties required by the type. The `source` property specifies which field in the source record the definition applies to. The `type` property controls where the source field's data is stored.
diff --git a/HBase-Storage-Cells.md b/HBase-Storage-Cells.md
@@ -2,7 +2,6 @@
 layout: page
 title: HBase Storage Cells
 ---
-## HBase Storage Cells
 
 HBase stores data as a group of values, or cells. HBase uniquely identifies each cell by a key. Using a key, you can look up the data for records stored in HBase very quickly. You can also insert, modify, or delete records in the middle of a dataset. HBase makes this possible by organizing data by storage key.
 
diff --git a/Inferring-a-Schema-from-a-Java-Class.md b/Inferring-a-Schema-from-a-Java-Class.md
@@ -1,10 +1,8 @@
 ---
 layout: page
-title: Infer a Schema from Java
+title: Inferring a Schema from a Java Class
 ---
 
-## Inferring a Schema from a Java Class
-
 You can use the `DatasetDescriptor.Builder#schema(Class<?> type)` method to infer a dataset schema from the instance variable fields of a Java class.
 
 For example, the following class defines a Java object that provides access to the ID, Title, Release Date, and IMDB URL for a movie database.
@@ -79,7 +77,7 @@ DatasetDescriptor movieDesc = new DatasetDescriptor.Builder()
 
 The Builder uses the field names and data types to construct an Avro schema definition, which for the `Movie` class looks like this.
 
-```
+```json
 {
   "type":"record",
   "name":"Movie",
diff --git a/Inferring-a-Schema-from-an-Avro-Data-File.md b/Inferring-a-Schema-from-an-Avro-Data-File.md
@@ -1,8 +1,7 @@
 ---
 layout: page
-title: Infer a Schema from Avro
+title: Inferring a Schema from an Avro Data File
 ---
-## Inferring a Schema from an Avro Data File
 
 You can use the `DatasetDescriptor.Builder.schemaFromAvroDataFile` method to use the schema of an existing data file in Avro format. The source can be a local file, an InputStream, or a URI.
 
diff --git a/Kite-Data-Module-Overview.md b/Kite-Data-Module-Overview.md
@@ -1,10 +1,8 @@
 ---
 layout: page
-title: Kite Data Overview
+title: Kite Data Module Overview
 ---
 
-## Kite Data Module Overview
-
 The Kite Data module is a set of APIs for interacting with data in Hadoop; specifically, direct reading and writing of datasets in storage subsystems such as the Hadoop Distributed FileSystem (HDFS).
 
 These APIs do not replace or supersede any of the existing Hadoop APIs. Instead, the Data module streamlines application of those APIs. You still use HDFS and Avro APIs directly, when necessary. The Kite Data module reflects best practices for default choices, data organization, and metadata system integration.
@@ -16,7 +14,7 @@ The data module contains APIs and utilities for defining and performing actions
 * <a href="#entities">entities</a>
 * <a href="#schemas">schemas</a>
 * <a href="#datasets">datasets</a>
-* <a href="#repositories">repositories</a>
+* <a href="#repositories">dataset repositories</a>
 * <a href="#loading">loading data</a>
 * <a href="#viewing">viewing data</a>
 
@@ -25,7 +23,7 @@ Many of these objects are interfaces, permitting multiple implementations, each
 While, in theory, any implementation of Hadoop’s `FileSystem` abstract class is supported by the Kite Data module, only the local and HDFS filesystem implementations are tested and officially supported.
 
 
-### Entities
+## Entities
 
 An entity is a single record in a dataset. The name _entity_ is a better term than _record_, because _record_ sounds as if it is a simple list of primitives, while _entity_ sounds more like a Plain Old Java Object you would find in a JPA class (see [JPA Entity](https://en.wikipedia.org/wiki/Java_Persistence_API#Entities) in Wikipedia.org). That said, _entity_ and _record_ are often used interchangeably when talking about datasets. 
 
@@ -34,7 +32,7 @@ Entities can be simple types, representing data structures with a few string att
 Best practices are to define the output for your system, identifying all of the field values required to produce the report or analytics results you need. Once you identify your required fields, you define one or more related entities where you store the information you need to create your output. Define the format and structure for your entities using a schema.
 
 
-### Schemas
+## Schemas
 
 A schema defines the field names and datatypes for a dataset. Kite relies on an Apache Avro schema definition for each dataset. For example, this is the schema definition for a table listing movies from the `movies.csv` dataset.
 
@@ -61,7 +59,7 @@ The goal is to get the schema into `.avsc` format and store it in the Hadoop fil
 
 
 
-### Datasets
+## Datasets
 A dataset is a collection of zero or more entities, represented by the interface `Dataset`. The relational database analog of a dataset is a table.
 
 The HDFS implementation of a dataset is stored as Snappy-compressed Avro data files by default. The HDFS implementation is made up of zero or more files in a directory. You also have the option of storing your dataset in the column-oriented Parquet file format.
@@ -72,7 +70,7 @@ You can work with a subset of dataset entities using the Views API.
 
 <a name="repositories" />
 
-### Dataset Repository
+## Dataset Repositories
 
 A _dataset repository_ is a physical storage location for datasets. Keeping with the relational database analogy, a dataset repository is the equivalent of a database of tables.
 
@@ -84,13 +82,13 @@ Each dataset belongs to exactly one dataset repository. Kite doesn&apos;t provid
 
 <a name="loading" />
 
-### Loading data from CSV
+## Loading Data from CSV
 
 You can load comma separated value data into a dataset repository using the command line interface function [csv-import](../Kite-Dataset-Command-Line-Interface/index.html#csvImport). 
 
 <a name="viewing" />
 
-### Viewing Your Data
+## Viewing Your Data
 
 Datasets you create Kite are no different than any other Hadoop dataset in your system, once created. You can query the data with Hive or view it using Impala.
 
diff --git a/Kite-Dataset-Command-Line-Interface.md b/Kite-Dataset-Command-Line-Interface.md
@@ -1,9 +1,7 @@
 ---
 layout: page
-title: Kite CLI
+title: Kite Dataset Command Line Interface
 ---
-## Kite Dataset Command Line Interface
-
 
 The Kite Dataset command line interface (CLI) provides utility commands that let you perform essential tasks such as creating a schema and dataset, importing data from a CSV file, and viewing the results.
 
diff --git a/Kite-SDK-Guide.md b/Kite-SDK-Guide.md
@@ -1,9 +1,9 @@
 ---
 layout: page
-title: Kite SDK Guide
+title: What is Kite?
 ---
 
-## What Is Kite, and Why Is It Awesome?
+## (and What Makes It So Awesome?)
 
 You can learn about why Kite is awesome by watching this <a href="http://www.youtube.com/watch?feature=player_embedded&v=JXAm3aasI6c">Kite Overview video</a>.
 
@@ -27,7 +27,7 @@ Things should just work together. Hadoop forces you to spend more time thinking
 Hadoop is not difficult to use. The complexity comes from the many parts that comprise a very large animal. Each piece, in isolation, is straightforward and easy to understand.
 
 
-### Enter Kite
+## Enter Kite
 
 This is where Kite comes in. Kite provides additional support for this infrastructure one level up in the stack so that they are codified in APIs that make sense to developers. 
 
diff --git a/Parquet-vs-Avro-Format.md b/Parquet-vs-Avro-Format.md
@@ -1,8 +1,7 @@
 ---
 layout: page
-title: Parquet vs Avro
+title: Parquet vs Avro Format
 ---
-## Parquet versus Avro Format
 
 Avro is a row-based storage format for Hadoop.
 
diff --git a/Partition-Strategy-Format.md b/Partition-Strategy-Format.md
@@ -1,8 +1,7 @@
 ---
 layout: page
-title: Partition Strategy Format
+title: Partition Strategy JSON Format
 ---
-## Partition Strategy JSON Format
 
 A partition strategy is made up of a list of partition fields. Each field defines how to take source data from an entity and produce a value that is used to store the entity. For example, a field can produce the year an event happened from its timestamp. Another field in the strategy can be the month from the timestamp.
 
@@ -29,7 +28,7 @@ A field definition can optionally provide a `name` attribute, which is used to r
 
 Requirements for the source data are validated when schemas and partition strategies are used together. 
 
-### Examples
+## Examples
 
 This strategy uses the year, month, and day from the "received_at" timestamp field on an event.
 
@@ -50,7 +49,7 @@ This strategy hashes and embeds the "email" field from a user record.
 ]
 ```
 
-#### Notes:
+### Notes:
 1. Source timestamps must be [long][avro-types] fields. The value encodes the number of milliseconds since unix epoch, as in Joda Time's [Instant][timestamp] and Java's Date.
 2. The `buckets` attribute is required for `hash` partitions and controls the number of partitions into which the entities should be pseudo-randomly distributed.
 
diff --git a/Partitioned-Datasets.md b/Partitioned-Datasets.md
@@ -2,7 +2,6 @@
 layout: page
 title: Partitioned Datasets
 ---
-## Partitioned Datasets
 
 <a href="https://www.youtube.com/watch?v=rU1YAvmU6mY&index=3&list=PLGzsQf6UXBR-BJz5BGzJb2mMulWTfTu99">
 <img src="https://raw.githubusercontent.com/DennisDawson/KiteImages/master/partitionTitleSlide.png" 
diff --git a/Schema-URL-Warning.md b/Schema-URL-Warning.md
@@ -3,13 +3,17 @@ layout: page
 title: Schema URL Warning
 ---
 This page explains the schema URL warning:
+
+```bash
 > The Dataset is using a schema literal rather than a URL which will be attached to every message.
+```
+
+This warning means that the dataset is configured using an Avro schema string, a schema object, or by reflection. Configuring with an HDFS URL where the schema can be found, instead of the other options, allows certain components to pass the schema URL rather than the schema's string literal. This cuts down on the size of headers that must be sent with each message.
 
-This warning means that the Dataset is configured using an Avro schema string, a schema object, or by reflection. Configuring with an HDFS URL where the schema can be found, instead of the other options, allows certain components to pass the schema URL rather than the schema's string literal. This cuts down on the size of headers that must be sent with each message.
+## Fixing the problem
 
-### Fixing the problem
+The following Java code demonstrates how to change the descriptor to use a schema URL instead of a schema literal:
 
-The following java code demonstrates how to change the descriptor to use a schema URL instead of a schema literal:
 ```java
 // a path in HDFS where schemas should be stored
 Path schemaFolder = new Path("hdfs:/data/schemas");
diff --git a/Using-the-Kite-CLI-to-Create-a-Dataset.md b/Using-the-Kite-CLI-to-Create-a-Dataset.md
@@ -1,17 +1,15 @@
 ---
 layout: page
-title: Using Kite CLI
+title: Using the Kite Command Line Interface to Create a Dataset
 ---
 
-## Using the Kite Command Line Interface to Create a Dataset
-
 <a href="https://www.youtube.com/watch?v=li3erFGiEw8&list=PLGzsQf6UXBR-BJz5BGzJb2mMulWTfTu99&index=2">
 <img src="https://raw.githubusercontent.com/DennisDawson/KiteImages/master/CLItitle.jpg" 
 alt="Kite CLI Video" width="240" height="180" border="10" align="right" title="Link to Kite CLI Video"/></a>
 
 Kite provides a set of tools that handle the basic legwork for creating a dataset, allowing you to focus on the specifics of the business problem you want to solve. This short tutorial walks you through the process of creating a dataset and viewing the results using the command line interface (CLI).
 
-### Preparation
+## Preparation
 
 If you have not done so already, download the Kite command-line interface jar. This jar is the executable that runs the command-line interface, so save it as `dataset`. To download with curl, run:
 
@@ -20,7 +18,7 @@ curl https://repository.cloudera.com/artifactory/libs-release-local/org/kitesdk/
 chmod +x dataset
 ```
 
-### Create a CSV Data File
+## Create a CSV Data File
 
 If you have a CSV file sitting around waiting to be used, you can substitute your file for the one that follows. The truth is, it doesn't matter if you have 100 columns or 2 columns, the process is the same. Larger datasets are only larger, not more complex.
 
@@ -33,7 +31,7 @@ Reuben, Pastrami and sauerkraut on toasted rye with Russian dressing.
 PBJ, Peanut butter and grape jelly on white bread.
 ```
 
-### Infer the Schema
+## Infer the Schema
 
 All right. Now we get to use the CLI. Start by inferring an Avro schema file from the *sandwiches.csv* file you just created. Enter the following command to create an Avro schema file named *sandwich.avsc* with the class name *Sandwich*. The schema details are based on the headings and data in *sandwiches.csv*.
 
@@ -58,7 +56,7 @@ If you open *sandwich.avsc* in a text editor, it looks something like the code b
 }
 ```
 
-### Create the Dataset
+## Create the Dataset
 
 With a schema, you can create a new dataset. Enter the following command.
 
@@ -89,7 +87,7 @@ You'll get the same schema back, but this time, trust me, it's coming from the H
 }
 ```
 
-### Import the CSV Data
+## Import the CSV Data
 You've created a dataset in the Hive repository, which is the container, but not the information itself. Next, you might want to add some data so that you can run some queries. Use the following command to import the sandwiches in your CSV file.
 
 `dataset csv-import sandwiches.csv sandwiches`
diff --git a/_includes/header.html b/_includes/header.html
@@ -16,11 +16,11 @@
             c0-0.82,0.665-1.484,1.484-1.484h15.031C17.335,12.031,18,12.696,18,13.516L18,13.516z"/>
         </svg>
       </a>
-      <div class="trigger">
+      <!--div class="trigger">
         {% for page in site.pages %}
           <a class="page-link" href="{{ page.url | prepend: site.baseurl }}">{{ page.title }}</a>
         {% endfor %}
-      </div>
+      </div -->
     </nav>
 
   </div>
diff --git a/index.md b/index.md
@@ -1,35 +1,33 @@
 ---
 layout: page
-title: Welcome to Kite
+title: Welcome to the Kite Wiki!
 ---
 
-## Welcome to the Kite Wiki!
-
 This is the default landing page for Kite SDK documentation.
 
-### Kite SDK Overview
+## Kite SDK Overview
 
 
-* [What is Kite, and Why Is It Awesome?](Kite-SDK-Guide/)
+* [What is Kite? (and What Makes It So Awesome?)](Kite-SDK-Guide/)
 
 * [Kite Data Module Overview](Kite-Data-Module-Overview/)
 
 
 
-### Kite Dataset Command Line Interface
+## Kite Dataset Command Line Interface
 
 * [Kite SDK Dataset CLI](Kite-Dataset-Command-Line-Interface/)
 * [Using the Kite Dataset CLI to Create a Dataset](Using-the-Kite-CLI-to-Create-a-Dataset/)
 
 
-### Conceptual Topics
+## Conceptual Topics
 
 * [Parquet vs Avro Format](Parquet-vs-Avro-Format/)
 * [Partitioned Datasets](Partitioned-Datasets/)
 * [Column Mapping](Column-Mapping/)
 * [HBase Storage Cells](HBase-Storage-Cells/)
 
-### Miscellaneous
+## Miscellaneous
 
 * [Schema URL Warning](Schema-URL-Warning/)