@@ -147,10 +147,10 @@ This document uses convenient shorthand for standard types:
147147* ` R4 ` , ` R8 ` : single and double precision floating-point
148148
149149* ` I1 ` , ` I2 ` , ` I4 ` , ` I8 ` : signed integer types with the indicated number of
150- bytes
150+ bytes
151151
152152* ` U1 ` , ` U2 ` , ` U4 ` , ` U8 ` : unsigned integer types with the indicated number of
153- bytes
153+ bytes
154154
155155* ` UG ` : unsigned type with 16-bytes, typically used as a unique ID
156156
@@ -161,10 +161,10 @@ bytes
161161* ` DZ ` : datetime zone, a date and time with a timezone
162162
163163* ` U4[100-199] ` : A key type based on ` U4 ` representing legal values from 100
164- to 199, inclusive
164+ to 199, inclusive
165165
166166* ` V<R4,3,2> ` : A vector type with item type ` R4 ` and dimensionality
167- information [ 3,2]
167+ information [ 3,2]
168168
169169See the sections on the specific types for more detail.
170170
@@ -233,18 +233,18 @@ type, which is a compatible column type.
233233
234234For example:
235235
236- * A column may have a ` BL ` valued piece of metadata associated with the string
237- ` IsNormalized ` indicating whether the column can be interpreted as a label .
236+ * A column may indicate that it is normalized, by providing a ` BL ` valued
237+ piece of metadata named ` IsNormalized ` .
238238
239239* A column whose type is ` V<R4,17> ` , meaning a vector of length 17 whose items
240- are single-precision floating-point values, might have ` SlotNames ` metadata of
241- type ` V<TX,17> ` , meaning a vector of length 17 whose items are text.
240+ are single-precision floating-point values, might have ` SlotNames ` metadata
241+ of type ` V<TX,17> ` , meaning a vector of length 17 whose items are text.
242242
243243* A column produced by a scorer may have several pieces of associated
244- metadata, indicating the "scoring column group id" that it belongs to, what
245- kind of scorer produced the column (e.g., binary classification), and the
246- precise semantics of the column (e.g., predicted label, raw score,
247- probability).
244+ metadata, indicating the "scoring column group id" that it belongs to, what
245+ kind of scorer produced the column (e.g., binary classification), and the
246+ precise semantics of the column (e.g., predicted label, raw score,
247+ probability).
248248
249249The ` ISchema ` interface, including the metadata API, is fully specified in
250250another document.
@@ -401,7 +401,7 @@ Notes:
401401 representation values are from one up to and including ` Count ` . The ` Count `
402402 is required to be representable in the underlying type, so, for example, the
403403 ` Count ` value of a key type based on ` System.Byte ` must not exceed ` 255 ` . As
404- an example of the usefulness of the ` Count ` property, consider the
404+ an example of the usefulness of the ` Count ` property, consider the
405405 ` KeyToVector ` transform implemented as part of ML.NET. It maps from a key
406406 type value to an indicator vector. The length of the vector is the ` Count `
407407 of the key type, which is required to be positive. For a key value of ` k ` ,
@@ -416,7 +416,7 @@ Notes:
416416
417417* The ` Min ` property returns the minimum semantic value of the key type. This
418418 is used exclusively for transforming from a representation value, where the
419- valid values start at one, to user facing values, which might start at any
419+ valid values start at one, to user facing values, which might start at any
420420 non-negative value. The most common values for ` Min ` are zero and one.
421421
422422* The boolean ` Contiguous ` property indicates whether values of the key type
@@ -428,13 +428,13 @@ Notes:
428428
429429* A key type can be non-` Contiguous ` only if ` Count ` is zero. The converse
430430 however is not true. A key type that is contiguous but has ` Count ` equal to
431- zero is one where there is a reasonably small maximum, but that maximum is
431+ zero is one where there is a reasonably small maximum, but that maximum is
432432 unknown. In this case, an array might be a good choice for a map from the
433433 key type.
434434
435435* The shorthand for a key type with representation type ` U1 ` , and semantic
436436 values from ` 1000 ` to ` 1099 ` , inclusive, is ` U1[1000-1099] ` . Note that the
437- ` Min ` value of this key type is outside the range of the underlying type,
437+ ` Min ` value of this key type is outside the range of the underlying type,
438438 ` System.Byte ` , but the ` Count ` value is only ` 100 ` , which is representable
439439 in a ` System.Byte ` . Recall that the representation values always start at 1
440440 and extend up to ` Count ` , in this case ` 100 ` .
@@ -454,7 +454,7 @@ There are standard conversions from one key type to another, provided:
454454
455455* Either the number of bytes in the destination's underlying type is greater
456456 than the number of bytes in the source's underlying type, or the ` Count `
457- value is positive. In the latter case, the ` Count ` is necessarily less than
457+ value is positive. In the latter case, the ` Count ` is necessarily less than
458458 2k, where k is the number of bits in the destination type's underlying type.
459459 For example, ` U1[1-*] ` can be converted to ` U2[1-*] ` , but ` U2[1-*] ` cannot
460460 be converted to ` U1[1-*] ` . Also, ` U1[1-100] ` and ` U2[1-100] ` can be
@@ -502,17 +502,17 @@ partitioned into an unknown number of runs of consecutive slots each of length
502502` 64 ` .
503503
504504As another example, consider an image data set. The data starts with a ` TX `
505- column containing URLs for images. Applying a BitmapLoader transform generates
506- a column of a custom (non-standard) type, ` Picture<*,*,4> ` , where the
507- asterisks indicate that the picture dimensions are unknown. The last dimension
508- of ` 4 ` indicates that there are four channels in each pixel: the three color
509- components, plus the alpha channel. Applying a ` BitmapScaler ` transform scales
510- and crops the images to a specified size, for example, ` 100x100 ` , producing a
511- type of ` Picture<100,100,4> ` . Finally, applying a ` PixelExtractor ` transform
512- (and specifying that the alpha channel should be dropped), produces the vector
513- type ` V<R4,3,100,100> ` . In this example, the ` PixelExtractor ` re-organized the
514- color information into separate planes, and divided each pixel value by 256 to
515- get pixel values between zero and one.
505+ column containing URLs for images. Applying an ` ImageLoader ` transform
506+ generates a column of a custom (non-standard) type, ` Picture<*,*,4> ` , where
507+ the asterisks indicate that the picture dimensions are unknown. The last
508+ dimension of ` 4 ` indicates that there are four channels in each pixel: the
509+ three color components, plus the alpha channel. Applying an ` ImageResizer `
510+ transform scales and crops the images to a specified size, for example,
511+ ` 100x100 ` , producing a type of ` Picture<100,100,4> ` . Finally, applying a
512+ ` ImagePixelExtractor ` transform (and specifying that the alpha channel should
513+ be dropped), produces the vector type ` V<R4,3,100,100> ` . In this example, the
514+ ` ImagePixelExtractor ` re-organized the color information into separate planes,
515+ and divided each pixel value by 256 to get pixel values between zero and one.
516516
517517### Equivalence
518518
@@ -556,14 +556,14 @@ Notes:
556556
557557* The ` Indices ` array is only relevant when the vector is sparse. In the
558558 sparse case, ` Indices ` is parallel to ` Values ` , only the first ` Count ` items
559- are meaningful, the indices must be non-negative and less than ` Length ` ,
560- and the indices must be strictly increasing. Note that when ` Count ` is zero,
559+ are meaningful, the indices must be non-negative and less than ` Length ` , and
560+ the indices must be strictly increasing. Note that when ` Count ` is zero,
561561 ` Indices ` may be null. In the dense case, ` Indices ` is not meaningful and
562562 may or may not be null.
563563
564564* It is very common for the arrays in a ` VBuffer<T> ` to be larger than needed
565565 for their current value. A special case of this is when a dense ` VBuffer<T> `
566- has a non-null ` Indices ` array. The extra items in the arrays are not
566+ has a non-null ` Indices ` array. The extra items in the arrays are not
567567 meaningful and should be ignored. Allowing these buffers to be larger than
568568 currently needed reduces the need to reallocate buffers for different
569569 values. For example, when cursoring through a vector valued column with
@@ -574,7 +574,7 @@ Notes:
574574
575575* Generally, vectors should use a sparse representation only when the number
576576 of non-default items is at most half the value of Length. However, this
577- guideline is not a mandate.
577+ guideline is not a mandate.
578578
579579See the full ` IDataView ` technical specification for additional details on
580580` VBuffer<T> ` , including complete discussion of programming idioms, and
@@ -668,7 +668,7 @@ There are standard conversions from one key type to another, provided:
668668
669669* Either the number of bytes in the destination's underlying type is greater
670670 than the number of bytes in the source's underlying type, or the ` Count `
671- value is positive. In the latter case, the ` Count ` is necessarily less than
671+ value is positive. In the latter case, the ` Count ` is necessarily less than
672672 ` 2^^k ` , where ` k ` is the number of bits in the destination type's underlying
673673 type. For example, ` U1[1-*] ` can be converted to ` U2[1-*] ` , but ` U2[1-*] `
674674 cannot be converted to ` U1[1-*] ` . Also, ` U1[1-100] ` and ` U2[1-100] ` can be
@@ -709,7 +709,7 @@ In the following notes, the symbol `type` is a variable of type `ColumnType`.
709709
710710* Certain .Net types have a corresponding ` DataKind ` ` enum ` value. The value
711711 of the ` type.RawKind ` property is consistent with ` type.RawType ` . For .Net
712- types that do not have a corresponding ` DataKind ` value, the ` type.RawKind `
712+ types that do not have a corresponding ` DataKind ` value, the ` type.RawKind `
713713 property returns zero. The ` type.RawKind ` property is particularly useful
714714 when switching over raw type possibilities, but only after testing for the
715715 broader kind of the type (key type, numeric type, etc.).
@@ -730,22 +730,22 @@ In the following notes, the symbol `type` is a variable of type `ColumnType`.
730730
731731* If ` type ` is a key type, then ` type.KeyCount ` is the same as
732732 ` ((KeyType)type).Count ` . If ` type ` is not a key type, then ` type.KeyCount `
733- is zero. Note that a key type can have a ` Count ` value of zero, indicating
733+ is zero. Note that a key type can have a ` Count ` value of zero, indicating
734734 that the count is unknown, so ` type.KeyCount ` being zero does not imply that
735735 ` type ` is not a key type. In summary, ` type.KeyCount ` is equivalent to:
736736 ` type is KeyType ? ((KeyType)type).Count : 0 ` .
737737
738738* The ` type.ItemType ` property is the item type of the vector type, if ` type `
739739 is a vector type, and is the same as ` type ` otherwise. For example, to test
740- for a type that is either ` TX ` or a vector of ` TX ` , one can use
740+ for a type that is either ` TX ` or a vector of ` TX ` , one can use
741741 ` type.ItemType.IsText ` .
742742
743743* The ` type.IsKnownSizeVector ` property is equivalent to `type.VectorSize >
744744 0`.
745745
746746* The ` type.VectorSize ` property is zero if either ` type ` is not a vector type
747747 or if ` type ` is a vector type of unknown/variable length. Otherwise, it is
748- the length of vectors belonging to the type.
748+ the length of vectors belonging to the type.
749749
750750* The ` type.ValueCount ` property is one if ` type ` is not a vector type and the
751751 same as ` type.VectorSize ` if ` type ` is a vector type.
@@ -756,7 +756,7 @@ In the following notes, the symbol `type` is a variable of type `ColumnType`.
756756
757757* The ` SameSizeAndItemType ` method is the same as ` Equals ` for non-vector
758758 types. For vector types, it returns true iff the two types have the same
759- item type and have the same ` VectorSize ` values. For example, for the two
759+ item type and have the same ` VectorSize ` values. For example, for the two
760760 vector types ` V<R4,3,2> ` and ` V<R4,6> ` , ` Equals ` returns false but
761761 ` SameSizeAndItemType ` returns true.
762762
0 commit comments