Working with data frames
In this section, we look at various features of the F# data frame library (using both
Series and Frame types and modules). Feel free to jump to the section you are interested
in, but note that some sections refer back to values built in "Creating & loading".
Creating frames & loading data
Loading and saving CSV files
The easiest way to get data into data frame is to use a CSV file. The Frame.ReadCsv
function exposes this functionality:
// Assuming 'root' is a directory containing the file
let titanic = Frame.ReadCsv(root + "titanic.csv")
// Read data and set the index column & order rows
let msft =
Frame.ReadCsv(root + "stocks/msft.csv")
|> Frame.indexRowsDate "Date"
|> Frame.sortRowsByKey
// Specify column separator
let air = Frame.ReadCsv(root + "airquality.csv", separators=";")
In the second example, we call indexRowsDate to use the "Date" column as a row index
of the resulting data frame. This is a very common scenario and so Deedle provides an
easier option using a generic overload of the ReadCsv method:
let msftSimpler =
Frame.ReadCsv<DateTime>(root + "stocks/msft.csv", indexCol="Date")
|> Frame.sortRowsByKey
The ReadCsv method has a number of optional arguments that you can use to control
the loading. It supports both CSV files, TSV files and other formats. If the file name
ends with tsv, the Tab is used automatically, but you can set separator explicitly.
The following parameters can be used:
path- Specifies a file name or an web location of the resource.-
indexCol- Specifies the column that should be used as an index in the resulting frame. The type is specified via a type parameter. -
inferTypes- Specifies whether the method should attempt to infer types of columns automatically (set this tofalseif you want to specify schema) -
inferRows- IfinferTypes=true, this parameter specifies the number of rows to use for type inference. The default value is 100. Value 0 means all rows. -
schema- A string that specifies CSV schema. See the documentation for information about the schema format. -
separators- A string that specifies one or more (single character) separators that are used to separate columns in the CSV file. Use for example";"to parse semicolon separated files. -
culture- Specifies the name of the culture that is used when parsing values in the CSV file (such as"en-US"). The default is invariant culture.
Once you have a data frame, you can also save it to a CSV file using the
SaveCsv method. For example:
// Save CSV with semicolon separator
air.SaveCsv(Path.GetTempFileName(), separator=';')
// Save as CSV and include row key as "Date" column
msft.SaveCsv(Path.GetTempFileName(), keyNames=["Date"], separator='\t')
By default, the SaveCsv method does not include the key from the data frame. This can be
overridden by calling SaveCsv with the optional argument includeRowKeys=true, or with an
additional argument keyNames (demonstrated above) which sets the headers for the key column(s)
in the CSV file.
Loading F# records or .NET objects
If you have another .NET or F# components that returns data as a sequence of F# records,
C# anonymous types or other .NET objects, you can use Frame.ofRecords to turn them
into a data frame. Assume we have:
type Person =
{ Name:string; Age:int; Countries:string list; }
let peopleRecds =
[ { Name = "Joe"; Age = 51; Countries = [ "UK"; "US"; "UK"] }
{ Name = "Tomas"; Age = 28; Countries = [ "CZ"; "UK"; "US"; "CZ" ] }
{ Name = "Eve"; Age = 2; Countries = [ "FR" ] }
{ Name = "Suzanne"; Age = 15; Countries = [ "US" ] } ]
Now we can easily create a data frame that contains three columns
(Name, Age and Countries) containing data of the same type as
the properties of Person:
// Turn the list of records into data frame
let peopleList = Frame.ofRecords peopleRecds
// Use the 'Name' column as a key (of type string)
let people = peopleList |> Frame.indexRowsString "Name"
people?Age
people.GetColumn<string list>("Countries")
Expanding objects in columns
For frames that contain complex .NET objects as column values, you can use Frame.expandCols
to create a new frame that contains properties of the object as new columns. For example:
// Create frame with single column 'People'
let peopleNested =
[ "People" => Series.ofValues peopleRecds ] |> frame
// Expand the 'People' column
peopleNested |> Frame.expandCols ["People"]
|
Manipulating data frames
Getting data from a frame
|
To get a column (series) from a frame df, you can use operations that are exposed directly
by the data frame, or you can use df.Columns which returns all columns of the frame as a
series of series.
// Get the 'Age' column as a series of 'float' values
people?Age
// Get the 'Countries' column as a series of 'string list' values
people.GetColumn<string list>("Countries")
// Get all frame columns as a series of series
people.Columns
Adding rows and columns
The series type is immutable and so it is not possible to add new values to a series or
change the values stored in an existing series. However, you can use operations that return
a new series as the result such as Merge.
// Create series with more value
let more = series [ "John" => 48.0 ]
// Create a new, concatenated series
people?Age.Merge(more)
Data frame allows a very limited form of mutation. It is possible to add new series (as a column) to an existing data frame, drop a series or replace a series.
// Calculate age + 1 for all people
let add1 = people?Age |> Series.mapValues ((+) 1.0)
// Add as a new series to the frame
people?AgePlusOne <- add1
// Add new series from a list of values
people?Siblings <- [0; 2; 1; 3]
// Replace existing series with new values
people.ReplaceColumn("Siblings", [3; 2; 1; 0])
// Create new object series with values for required columns
let newRow =
[ "Name" => box "Jim"; "Age" => box 51;
"Countries" => box ["US"]; "Siblings" => box 5 ]
|> series
// Create a new data frame, containing the new series
people.Merge("Jim", newRow)
Advanced slicing and lookup
Given a series, we have a number of options for getting one or more values or observations from the series.
// Get an unordered sample series
let ages = people?Age
// Returns value for a given key
ages.["Tomas"]
// Returns series with two keys from the source
ages.[ ["Tomas"; "Joe"] ]
// Returns 'None' when key is not present
ages |> Series.tryGet "John"
// Returns series with missing value for 'John'
ages |> Series.getAll [ "Tomas"; "John" ]
We can also obtain all data from the series. The data frame library uses the term observations for all key-value pairs:
// Get all observations as a sequence of tuples
ages |> Series.observations
// Get all observations, with 'None' for missing values
ages |> Series.observationsAll
With ordered series, we can use slicing to get a sub-range:
let opens = msft?Open
opens.[DateTime(2013, 1, 1) .. DateTime(2013, 1, 31)]
|> Series.mapKeys (fun k -> k.ToShortDateString())
|
Grouping data
Grouping series
let travels = people.GetColumn<string list>("Countries")
// Group by name length (ignoring visited countries)
travels |> Series.groupBy (fun k v -> k.Length)
// Group by visited countries (people visited/not visited US)
travels |> Series.groupBy (fun k v -> List.exists ((=) "US") v)
// Group by name length and get number of values in each group
travels |> Series.groupInto
(fun k v -> k.Length)
(fun len people -> Series.countKeys people)
travels
|> Series.mapValues (Seq.countBy id >> series)
|> Frame.ofRows
|> Frame.fillMissingWith 0
|
Grouping data frames
// Group using column 'Sex' of type 'string'
titanic |> Frame.groupRowsByString "Sex"
// Group using calculated value - length of name
titanic |> Frame.groupRowsUsing (fun k row ->
row.GetAs<string>("Name").Length)
let bySex = titanic |> Frame.groupRowsByString "Sex"
// Returns series with two frames as values
let bySex1 = bySex |> Frame.nest
// Converts unstacked data back to a single frame
let bySex2 = bySex |> Frame.nest |> Frame.unnest
// Group by passanger class and port
let byClassAndPort =
titanic
|> Frame.groupRowsByInt "Pclass"
|> Frame.groupRowsByString "Embarked"
|> Frame.mapRowKeys Pair.flatten3
// Get average ages in each group
byClassAndPort?Age
|> Stats.levelMean Pair.get1And2Of3
// Averages for all numeric columns
byClassAndPort
|> Frame.getNumericCols
|> Series.dropMissing
|> Series.mapValues (Stats.levelMean Pair.get1And2Of3)
|> Frame.ofColumns
// Count number of survivors in each group
byClassAndPort.GetColumn<bool>("Survived")
|> Series.applyLevel Pair.get1And2Of3 (Series.values >> Seq.countBy id >> series)
|> Frame.ofRows
Summarizing data with pivot table
A pivot table is a useful tool if you want to summarize data in the frame based on two keys that are available in the rows of the data frame.
titanic
|> Frame.pivotTable
// Returns a new row key
(fun k r -> r.GetAs<string>("Sex"))
// Returns a new column key
(fun k r -> r.GetAs<bool>("Survived"))
// Specifies aggregation for sub-frames
Frame.countRows
|
titanic
|> Frame.pivotTable
(fun k r -> r.GetAs<string>("Sex"))
(fun k r -> r.GetAs<bool>("Survived"))
(fun frame -> frame?Age |> Stats.mean)
|> round
|
Hierarchical indexing
Grouping and aggregating
Hierarchical keys are often created as a result of grouping. For example, we can group the rows (representing individual years) by decades:
let decades = msft |> Frame.groupRowsUsing (fun k _ ->
sprintf "%d0s" (k.Year / 10))
// Calculate means per decade for the Close column
decades?Close |> Stats.levelMean fst
// Calculate means per decade for all numeric columns
decades
|> Frame.getNumericCols
|> Series.mapValues (Stats.levelMean fst)
|> Frame.ofColumns
Handling missing values
The support for missing values is built-in, which means that any series or frame can
contain missing values. When constructing series or frames from data, certain values
are automatically treated as "missing values". This includes Double.NaN, null values
for reference types and for nullable types:
Series.ofValues [ Double.NaN; 1.0; 3.14 ]
|
[ Nullable(1); Nullable(); Nullable(3) ]
|> Series.ofValues
|
Missing values are automatically skipped when performing statistical computations such
as Series.mean. They are also ignored by projections and filtering, including
Series.mapValues. When you want to handle missing values, you can use Series.mapAll
that gets the value as option<T>:
// Get column with missing values
let ozone = air?Ozone
// Replace missing values with zeros
ozone |> Series.mapAll (fun k v ->
match v with None -> Some 0.0 | v -> v)
// Fill missing values with constant
ozone |> Series.fillMissingWith 0.0
// Available values are copied in backward
// direction to fill missing values
ozone |> Series.fillMissing Direction.Backward
// Available values are propagated forward
ozone |> Series.fillMissing Direction.Forward
// Fill values and drop those that could not be filled
ozone |> Series.fillMissing Direction.Forward
|> Series.dropMissing
Various other strategies for handling missing values are not currently directly
supported by the library, but can be easily added using Series.fillMissingUsing.
It takes a function and calls it on all missing values:
// Fill missing values using interpolation function
ozone |> Series.fillMissingUsing (fun k ->
// Get previous and next values
let prev = ozone.TryGet(k, Lookup.ExactOrSmaller)
let next = ozone.TryGet(k, Lookup.ExactOrGreater)
// Pattern match to check which values were available
match prev, next with
| OptionalValue.Present(p), OptionalValue.Present(n) ->
(p + n) / 2.0
| OptionalValue.Present(v), _
| _, OptionalValue.Present(v) -> v
| _ -> 0.0)
module Frame from Deedle
<summary> The `Frame` module provides an F#-friendly API for working with data frames. The module follows the usual desing for collection-processing in F#, so the functions work well with the pipelining operator (`|>`). For example, given a frame with two columns representing prices, we can use `Frame.pctChange` to calculate daily returns like this: let df = frame [ "MSFT" => prices1; "AAPL" => prices2 ] let rets = df |> Frame.pctChange 1 rets |> Stats.mean Note that the `Stats.mean` operation is overloaded and works both on series (returning a number) and on frames (returning a series). You can also use `Frame.diff` if you need absolute differences rather than relative changes. The functions in this module are designed to be used from F#. For a C#-friendly API, see the `FrameExtensions` type. For working with individual series, see the `Series` module. The functions in the `Frame` module are grouped in a number of categories and documented below. Accessing frame data and lookup ------------------------------- Functions in this category provide access to the values in the fame. You can also add and remove columns from a frame (which both return a new value). - `addCol`, `replaceCol` and `dropCol` can be used to create a new data frame with a new column, by replacing an existing column with a new one, or by dropping an existing column - `cols` and `rows` return the columns or rows of a frame as a series containing objects; `getCols` and `getRows` return a generic series and cast the values to the type inferred from the context (columns or rows of incompatible types are skipped); `getNumericCols` returns columns of a type convertible to `float` for convenience. - You can get a specific row or column using `get[Col|Row]` or `lookup[Col|Row]` functions. The `lookup` variant lets you specify lookup behavior for key matching (e.g. find the nearest smaller key than the specified value). There are also `[try]get` and `[try]Lookup` functions that return optional values and functions returning entire observations (key together with the series). - `sliceCols` and `sliceRows` return a sub-frame containing only the specified columns or rows. Finally, `toArray2D` returns the frame data as a 2D array. Grouping, windowing and chunking -------------------------------- The basic grouping functions in this category can be used to group the rows of a data frame by a specified projection or column to create a frame with hierarchical index such as <c>Frame<'K1 * 'K2, 'C></c>. The functions always aggregate rows, so if you want to group columns, you need to use `Frame.transpose` first. The function `groupRowsBy` groups rows by the value of a specified column. Use `groupRowsBy[Int|Float|String...]` if you want to specify the type of the column in an easier way than using type inference; `groupRowsUsing` groups rows using the specified _projection function_ and `groupRowsByIndex` projects the grouping key just from the row index. More advanced functions include: `aggregateRowsBy` which groups the rows by a specified sequence of columns and aggregates each group into a single value; `pivotTable` implements the pivoting operation [as documented in the tutorials](../frame.html#pivot). The `melt` and `unmelt` functions turn the data frame into a single data frame containing columns `Row`, `Column` and `Value` containing the data of the original frame; `melt` can be used to turn this representation back into an original frame. A simple windowing functions that are exposed for an entire frame operations are `window` and `windowInto`. For more complex windowing operations, you currently have to use `mapRows` or `mapCols` and apply windowing on individual series. Sorting and index manipulation ------------------------------ A frame is indexed by row keys and column keys. Both of these indices can be sorted (by the keys). A frame that is sorted allows a number of additional operations (such as lookup using the `Lookp.ExactOrSmaller` lookup behavior). The functions in this category provide ways for manipulating the indices. It is expected that most operations are done on rows and so more functions are available in a row-wise way. A frame can alwyas be transposed using `Frame.transpose`. Index operations: The existing row/column keys can be replaced by a sequence of new keys using the `indexColsWith` and `indexRowsWith` functions. Row keys can also be replaced by ordinal numbers using `indexRowsOrdinally`. The function `indexRows` uses the specified column of the original frame as the index. It removes the column from the resulting frame (to avoid this, use overloaded `IndexRows` method). This function infers the type of row keys from the context, so it is usually more convenient to use `indexRows[Date|String|Int|...]` functions. Finally, if you want to calculate the index value based on multiple columns of the row, you can use `indexRowsUsing`. Sorting frame rows: Frame rows can be sorted according to the value of a specified column using the `sortRows` function; `sortRowsBy` takes a projection function which lets you transform the value of a column (e.g. to project a part of the value). The functions `sortRowsByKey` and `sortColsByKey` sort the rows or columns using the default ordering on the key values. The result is a frame with ordered index. Expanding columns: When the frame contains a series with complex .NET objects such as F# records or C# classes, it can be useful to "expand" the column. This operation looks at the type of the objects, gets all properties of the objects (recursively) and generates multiple series representing the properties as columns. The function `expandCols` expands the specified columns while `expandAllCols` applies the expansion to all columns of the data frame. Frame transformations --------------------- Functions in this category perform standard transformations on data frames including projections, filtering, taking some sub-frame of the frame, aggregating values using scanning and so on. Projection and filtering functions such as `[map|filter][Cols|Rows]` call the specified function with the column or row key and an <c>ObjectSeries<'K></c> representing the column or row. You can use functions ending with `Values` (such as `mapRowValues`) when you do not require the row key, but only the row series; `mapRowKeys` and `mapColKeys` can be used to transform the keys. You can use `reduceValues` to apply a custom reduction to values of columns. Other aggregations are available in the `Stats` module. You can also get a row with the greaterst or smallest value of a given column using `[min|max]RowBy`. The functions `take[Last]` and `skip[Last]` can be used to take a sub-frame of the original source frame by skipping a specified number of rows. Note that this does not require an ordered frame and it ignores the index - for index-based lookup use slicing, such as `df.Rows.[lo .. hi]`, instead. Finally the `shift` function can be used to obtain a frame with values shifted by the specified offset. This can be used e.g. to get previous value for each key using `Frame.shift 1 df`. The `diff` function calculates difference from previous value using `df - (Frame.shift offs df)`. Processing frames with exceptions --------------------------------- The functions in this group can be used to write computations over frames that may fail. They use the type <c>tryval<'T></c> which is defined as a discriminated union with two cases: Success containing a value, or Error containing an exception. Using <c>tryval<'T></c> as a value in a data frame is not generally recommended, because the type of values cannot be tracked in the type. For this reason, it is better to use <c>tryval<'T></c> with individual series. However, `tryValues` and `fillErrorsWith` functions can be used to get values, or fill failed values inside an entire data frame. The `tryMapRows` function is more useful. It can be used to write a transformation that applies a computation (which may fail) to each row of a data frame. The resulting series is of type <c>Series<'R, tryval<'T>></c> and can be processed using the <c>Series</c> module functions. Missing values -------------- This group of functions provides a way of working with missing values in a data frame. The category provides the following functions that can be used to fill missing values: * `fillMissingWith` fills missing values with a specified constant * `fillMissingUsing` calls a specified function for every missing value * `fillMissing` and variants propagates values from previous/later keys We use the terms _sparse_ and _dense_ to denote series that contain some missing values or do not contain any missing values, respectively. The functions `denseCols` and `denseRows` return a series that contains only dense columns or rows and all sparse rows or columns are replaced with a missing value. The `dropSparseCols` and `dropSparseRows` functions drop these missing values and return a frame with no missing values. Joining, merging and zipping ---------------------------- The simplest way to join two frames is to use the `join` operation which can be used to perform left, right, outer or inner join of two frames. When the row keys of the frames do not match exactly, you can use `joinAlign` which takes an additional parameter that specifies how to find matching key in left/right join (e.g. by taking the nearest smaller available key). Frames that do not contian overlapping values can be combined using `merge` (when combining just two frames) or using `mergeAll` (for larger number of frames). Tha latter is optimized to work well for a large number of data frames. Finally, frames with overlapping values can be combined using `zip`. It takes a function that is used to combine the overlapping values. A `zipAlign` function provides a variant with more flexible row key matching (as in `joinAlign`) Hierarchical index operations ----------------------------- A data frame has a hierarchical row index if the row index is formed by a tuple, such as <c>Frame<'R1 * 'R2, 'C></c>. Frames of this kind are returned, for example, by the grouping functions such as <c>Frame.groupRowsBy</c>. The functions in this category provide ways for working with data frames that have hierarchical row keys. The functions <c>applyLevel</c> and <c>reduceLevel</c> can be used to reduce values according to one of the levels. The <c>applyLevel</c> function takes a reduction of type <c>Series<'K, 'T> -> 'T</c> while <c>reduceLevel</c> reduces individual values using a function of type <c>'T -> 'T -> 'T</c>. The functions <c>nest</c> and <c>unnest</c> can be used to convert between frames with hierarchical indices (<c>Frame<'K1 * 'K2, 'C></c>) and series of frames that represent individual groups (<c>Series<'K1, Frame<'K2, 'C>></c>). The <c>nestBy</c> function can be used to perform group by operation and return the result as a series of frems. </summary>
<category>Frame and series operations</category>
--------------------
type Frame = static member ReadCsv: location: string * [<Optional>] hasHeaders: Nullable<bool> * [<Optional>] inferTypes: Nullable<bool> * [<Optional>] inferRows: Nullable<int> * [<Optional>] schema: string * [<Optional>] separators: string * [<Optional>] culture: string * [<Optional>] maxRows: Nullable<int> * [<Optional>] missingValues: string array * [<Optional>] preferOptions: bool * [<Optional>] encoding: Encoding -> Frame<int,string> + 1 overload static member ReadReader: reader: IDataReader -> Frame<int,string> static member CustomExpanders: Dictionary<Type,Func<obj,(string * Type * obj) seq>> with get static member NonExpandableInterfaces: ResizeArray<Type> with get static member NonExpandableTypes: HashSet<Type> with get
<summary> Provides static methods for creating frames, reading frame data from CSV files and database (via IDataReader). The type also provides global configuration for reflection-based expansion. </summary>
<category>Frame and series operations</category>
--------------------
type Frame<'TRowKey,'TColumnKey (requires equality and equality)> = interface IDynamicMetaObjectProvider interface INotifyCollectionChanged interface IFrameFormattable interface IFsiFormattable interface IFrame new: rowIndex: IIndex<'TRowKey> * columnIndex: IIndex<'TColumnKey> * data: IVector<IVector> * indexBuilder: IIndexBuilder * vectorBuilder: IVectorBuilder -> Frame<'TRowKey,'TColumnKey> + 1 overload member AddColumn: column: 'TColumnKey * series: 'V seq -> unit + 3 overloads member AggregateRowsBy: groupBy: 'TColumnKey seq * aggBy: 'TColumnKey seq * aggFunc: Func<Series<'TRowKey,'a>,'b> -> Frame<int,'TColumnKey> member Clone: unit -> Frame<'TRowKey,'TColumnKey> member ColumnApply: f: Func<Series<'TRowKey,'T>,ISeries<'TRowKey>> -> Frame<'TRowKey,'TColumnKey> + 1 overload ...
<summary> A frame is the key Deedle data structure (together with series). It represents a data table (think spreadsheet or CSV file) with multiple rows and columns. The frame consists of row index, column index and data. The indices are used for efficient lookup when accessing data by the row key `'TRowKey` or by the column key `'TColumnKey`. Deedle frames are optimized for the scenario when all values in a given column are of the same type (but types of different columns can differ). </summary>
<remarks><para>Joining, zipping and appending:</para><para> More info </para></remarks>
<category>Core frame and series types</category>
--------------------
new: names: 'TColumnKey seq * columns: ISeries<'TRowKey> seq -> Frame<'TRowKey,'TColumnKey>
new: rowIndex: Indices.IIndex<'TRowKey> * columnIndex: Indices.IIndex<'TColumnKey> * data: IVector<IVector> * indexBuilder: Indices.IIndexBuilder * vectorBuilder: Vectors.IVectorBuilder -> Frame<'TRowKey,'TColumnKey>
static member Frame.ReadCsv: stream: Stream * [<Runtime.InteropServices.Optional>] hasHeaders: Nullable<bool> * [<Runtime.InteropServices.Optional>] inferTypes: Nullable<bool> * [<Runtime.InteropServices.Optional>] inferRows: Nullable<int> * [<Runtime.InteropServices.Optional>] schema: string * [<Runtime.InteropServices.Optional>] separators: string * [<Runtime.InteropServices.Optional>] culture: string * [<Runtime.InteropServices.Optional>] maxRows: Nullable<int> * [<Runtime.InteropServices.Optional>] missingValues: string array * [<Runtime.InteropServices.Optional>] preferOptions: Nullable<bool> * [<Runtime.InteropServices.Optional>] encoding: Text.Encoding -> Frame<int,string>
static member Frame.ReadCsv: location: string * [<Runtime.InteropServices.Optional>] hasHeaders: Nullable<bool> * [<Runtime.InteropServices.Optional>] inferTypes: Nullable<bool> * [<Runtime.InteropServices.Optional>] inferRows: Nullable<int> * [<Runtime.InteropServices.Optional>] schema: string * [<Runtime.InteropServices.Optional>] separators: string * [<Runtime.InteropServices.Optional>] culture: string * [<Runtime.InteropServices.Optional>] maxRows: Nullable<int> * [<Runtime.InteropServices.Optional>] missingValues: string array * [<Runtime.InteropServices.Optional>] preferOptions: bool * [<Runtime.InteropServices.Optional>] encoding: Text.Encoding -> Frame<int,string>
static member Frame.ReadCsv: path: string * ?hasHeaders: bool * ?inferTypes: bool * ?inferRows: int * ?schema: string * ?separators: string * ?culture: string * ?maxRows: int * ?missingValues: string array * ?preferOptions: bool * ?typeResolver: (string -> string option) * ?encoding: Text.Encoding -> Frame<int,string>
static member Frame.ReadCsv: stream: Stream * ?hasHeaders: bool * ?inferTypes: bool * ?inferRows: int * ?schema: string * ?separators: string * ?culture: string * ?maxRows: int * ?missingValues: string array * ?preferOptions: bool * ?typeResolver: (string -> string option) * ?encoding: Text.Encoding -> Frame<int,string>
static member Frame.ReadCsv: path: string * indexCol: string * ?hasHeaders: bool * ?inferTypes: bool * ?inferRows: int * ?schema: string * ?separators: string * ?culture: string * ?maxRows: int * ?missingValues: string array * ?preferOptions: bool * ?typeResolver: (string -> string option) * ?encoding: Text.Encoding -> Frame<'R,string> (requires equality)
<summary> Returns a data frame that contains the same data as the input, but whose rows are an ordered series. This allows using operations that are only available on indexed series such as alignment and inexact lookup. <category>Sorting and index manipulation</category> </summary>
[<Struct>] type DateTime = new: date: DateOnly * time: TimeOnly -> unit + 16 overloads member Add: value: TimeSpan -> DateTime member AddDays: value: float -> DateTime member AddHours: value: float -> DateTime member AddMicroseconds: value: float -> DateTime member AddMilliseconds: value: float -> DateTime member AddMinutes: value: float -> DateTime member AddMonths: months: int -> DateTime member AddSeconds: value: float -> DateTime member AddTicks: value: int64 -> DateTime ...
<summary>Represents an instant in time, typically expressed as a date and time of day.</summary>
--------------------
DateTime ()
(+0 other overloads)
DateTime(ticks: int64) : DateTime
(+0 other overloads)
DateTime(date: DateOnly, time: TimeOnly) : DateTime
(+0 other overloads)
DateTime(ticks: int64, kind: DateTimeKind) : DateTime
(+0 other overloads)
DateTime(date: DateOnly, time: TimeOnly, kind: DateTimeKind) : DateTime
(+0 other overloads)
DateTime(year: int, month: int, day: int) : DateTime
(+0 other overloads)
DateTime(year: int, month: int, day: int, calendar: Globalization.Calendar) : DateTime
(+0 other overloads)
DateTime(year: int, month: int, day: int, hour: int, minute: int, second: int) : DateTime
(+0 other overloads)
DateTime(year: int, month: int, day: int, hour: int, minute: int, second: int, kind: DateTimeKind) : DateTime
(+0 other overloads)
DateTime(year: int, month: int, day: int, hour: int, minute: int, second: int, calendar: Globalization.Calendar) : DateTime
(+0 other overloads)
static member FrameExtensions.SaveCsv: frame: Frame<'R,'C> * path: string * keyNames: string seq * [<Runtime.InteropServices.Optional>] separator: char * [<Runtime.InteropServices.Optional>] culture: Globalization.CultureInfo -> unit (requires equality and equality)
static member FrameExtensions.SaveCsv: frame: Frame<'R,'C> * writer: TextWriter * [<Runtime.InteropServices.Optional>] includeRowKeys: bool * [<Runtime.InteropServices.Optional>] keyNames: string seq * [<Runtime.InteropServices.Optional>] separator: char * [<Runtime.InteropServices.Optional>] culture: Globalization.CultureInfo -> unit (requires equality and equality)
static member FrameExtensions.SaveCsv: frame: Frame<'R,'C> * path: string * [<Runtime.InteropServices.Optional>] includeRowKeys: bool * [<Runtime.InteropServices.Optional>] keyNames: string seq * [<Runtime.InteropServices.Optional>] separator: char * [<Runtime.InteropServices.Optional>] culture: Globalization.CultureInfo -> unit (requires equality and equality)
member Frame.SaveCsv: writer: TextWriter * ?includeRowKeys: bool * ?keyNames: string seq * ?separator: char * ?culture: Globalization.CultureInfo -> unit
member Frame.SaveCsv: path: string * ?includeRowKeys: bool * ?keyNames: string seq * ?separator: char * ?culture: Globalization.CultureInfo -> unit
<summary>Performs operations on <see cref="T:System.String" /> instances that contain file or directory path information. These operations are performed in a cross-platform manner.</summary>
val string: value: 'T -> string
--------------------
type string = String
val int: value: 'T -> int (requires member op_Explicit)
--------------------
type int = int32
--------------------
type int<'Measure> = int
static member Frame.ofRecords: values: 'T seq -> Frame<int,string>
static member Frame.ofRecords: values: Collections.IEnumerable * indexCol: string -> Frame<'R,string> (requires equality)
<summary> Returns a data frame whose rows are indexed based on the specified column of the original data frame. This function casts (or converts) the column key to values of type `string` (a generic variant that may require some type annotation is `Frame.indexRows`) The specified column is removed from the resulting frame. </summary>
<param name="frame">Source data frame whose row index is to be replaced.</param>
<param name="column">The name of a column in the original data frame that will be used for the new index. Note that the values in the column need to be unique.</param>
<category>Sorting and index manipulation</category>
member Frame.GetColumn<'R> : column: 'TColumnKey * lookup: Lookup -> Series<'TRowKey,'R>
module Series from Deedle
<summary> The `Series` module provides an F#-friendly API for working with data and time series. The API follows the usual design for collection-processing in F#, so the functions work well with the pipelining (<c>|></c>) operator. For example, given a series with ages, we can use `Series.filterValues` to filter outliers and then `Stats.mean` to calculate the mean: ages |> Series.filterValues (fun v -> v > 0.0 && v < 120.0) |> Stats.mean The module provides comprehensive set of functions for working with series. The same API is also exposed using C#-friendly extension methods. In C#, the above snippet could be written as: [lang=csharp] ages .Where(kvp => kvp.Value > 0.0 && kvp.Value < 120.0) .Mean() For more information about similar frame-manipulation functions, see the `Frame` module. For more information about C#-friendly extensions, see `SeriesExtensions`. The functions in the `Series` module are grouped in a number of categories and documented below. Accessing series data and lookup -------------------------------- Functions in this category provide access to the values in the series. - The term _observation_ is used for a key value pair in the series. - When working with a sorted series, it is possible to perform lookup using keys that are not present in the series - you can specify to search for the previous or next available value using _lookup behavior_. - Functions such as `get` and `getAll` have their counterparts `lookup` and `lookupAll` that let you specify lookup behavior. - For most of the functions that may fail, there is a `try[Foo]` variant that returns `None` instead of failing. - Functions with a name ending with `At` perform lookup based on the absolute integer offset (and ignore the keys of the series) Series transformations ---------------------- Functions in this category perform standard transformations on series including projections, filtering, taking some sub-series of the series, aggregating values using scanning and so on. Projection and filtering functions generally skip over missing values, but there are variants `filterAll` and `mapAll` that let you handle missing values explicitly. Keys can be transformed using `mapKeys`. When you do not need to consider the keys, and only care about values, use `filterValues` and `mapValues` (which is also aliased as the `$` operator). Series supports standard set of folding functions including `reduce` and `fold` (to reduce series values into a single value) as well as the `scan[All]` function, which can be used to fold values of a series into a series of intermeidate folding results. The functions `take[Last]` and `skip[Last]` can be used to take a sub-series of the original source series by skipping a specified number of elements. Note that this does not require an ordered series and it ignores the index - for index-based lookup use slicing, such as `series.[lo .. hi]`, instead. Finally the `shift` function can be used to obtain a series with values shifted by the specified offset. This can be used e.g. to get previous value for each key using `Series.shift 1 ts`. The `diff` function calculates difference from previous value using `ts - (Series.shift offs ts)`. Processing series with exceptions --------------------------------- The functions in this group can be used to write computations over series that may fail. They use the type <c>tryval<'T></c> which is defined as a discriminated union with two cases: Success containing a value, or Error containing an exception. The function `tryMap` lets you create <c>Series<'K, tryval<'T>></c> by mapping over values of an original series. You can then extract values using `tryValues`, which throws `AggregateException` if there were any errors. Functions `tryErrors` and `trySuccesses` give series containing only errors and successes. You can fill failed values with a constant using `fillErrorsWith`. Hierarchical index operations ----------------------------- When the key of a series is tuple, the elements of the tuple can be treated as multiple levels of a index. For example <c>Series<'K1 * 'K2, 'V></c> has two levels with keys of types <c>'K1</c> and <c>'K2</c> respectively. The functions in this cateogry provide a way for aggregating values in the series at one of the levels. For example, given a series `input` indexed by two-element tuple, you can calculate mean for different first-level values as follows: input |> applyLevel fst Stats.mean Note that the `Stats` module provides helpers for typical statistical operations, so the above could be written just as `input |> Stats.levelMean fst`. Grouping, windowing and chunking -------------------------------- This category includes functions that group data from a series in some way. Two key concepts here are _window_ and _chunk_. Window refers to (overlapping) sliding windows over the input series while chunk refers to non-overlapping blocks of the series. The boundary behavior can be specified using the `Boundary` flags. The value `Skip` means that boundaries (incomplete windows or chunks) should be skipped. The value `AtBeginning` and `AtEnding` can be used to define at which side should the boundary be returned (or skipped). For chunking, `AtBeginning ||| Skip` makes sense and it means that the incomplete chunk at the beginning should be skipped (aligning the last chunk with the end). The behavior may be specified in a number of ways (which is reflected in the name): - `dist` - using an absolute distance between the keys - `while` - using a condition on the first and last key - `size` - by specifying the absolute size of the window/chunk The functions ending with `Into` take a function to be applied to the window/chunk. The functions `window`, `windowInto` and `chunk`, `chunkInto` are simplified versions that take a size. There is also `pairwise` function for sliding window of size two. Missing values -------------- This group of functions provides a way of working with missing values in a series. The `dropMissing` function drops all keys for which there are no values in the series. The `withMissingFrom` function lets you copy missing values from another series. The remaining functions provide different mechanism for filling the missing values. * `fillMissingWith` fills missing values with a specified constant * `fillMissingUsing` calls a specified function for every missing value * `fillMissing` and variants propagates values from previous/later keys Sorting and index manipulation ------------------------------ A series that is sorted by keys allows a number of additional operations (such as lookup using the `Lookp.ExactOrSmaller` lookup behavior). However, it is also possible to sort series based on the values - although the functions for manipulation with series do not guarantee that the order will be preserved. To sort series by keys, use `sortByKey`. Other sorting functions let you sort the series using a specified comparer function (`sortWith`), using a projection function (`sortBy`) and using the default comparison (`sort`). In addition, you can also replace the keys of a series with other keys using `indexWith` or with integers using `indexOrdinally`. To pick and reorder series values using to match a list of keys use `realign`. Sampling, resampling and advanced lookup ---------------------------------------- Given a (typically) time series sampling or resampling makes it possible to get time series with representative values at lower or uniform frequency. We use the following terminology: - `lookup` and `sample` functions find values at specified key; if a key is not available, they can look for value associated with the nearest smaller or the nearest greater key. - `resample` function aggregate values values into chunks based on a specified collection of keys (e.g. explicitly provided times), or based on some relation between keys (e.g. date times having the same date). - `resampleUniform` is similar to resampling, but we specify keys by providing functions that generate a uniform sequence of keys (e.g. days), the operation also fills value for days that have no corresponding observations in the input sequence. Joining, merging and zipping ---------------------------- Given two series, there are two ways to combine the values. If the keys in the series are not overlapping (or you want to throw away values from one or the other series), then you can use `merge` or `mergeUsing`. To merge more than 2 series efficiently, use the `mergeAll` function, which has been optimized for large number of series. If you want to align two series, you can use the _zipping_ operation. This aligns two series based on their keys and gives you tuples of values. The default behavior (`zip`) uses outer join and exact matching. For ordered series, you can specify other forms of key lookups (e.g. find the greatest smaller key) using `zipAlign`. functions ending with `Into` are generally easier to use as they call a specified function to turn the tuple (of possibly missing values) into a new value. For more complicated behaviors, it is often convenient to use joins on frames instead of working with series. Create two frames with single columns and then use the join operation. The result will be a frame with two columns (which is easier to use than series of tuples). </summary>
<category>Frame and series operations</category>
--------------------
type Series = static member ofNullables: values: Nullable<'a> seq -> Series<int,'a> (requires default constructor and value type and 'a :> ValueType) static member ofObservations: observations: ('a * 'b) seq -> Series<'a,'b> (requires equality) static member ofOptionalObservations: observations: ('K * 'a option) seq -> Series<'K,'a> (requires equality) static member ofValues: values: 'a seq -> Series<int,'a>
--------------------
type Series<'K,'V (requires equality)> = interface ISeriesFormattable interface IFsiFormattable interface ISeries<'K> new: index: IIndex<'K> * vector: IVector<'V> * vectorBuilder: IVectorBuilder * indexBuilder: IIndexBuilder -> Series<'K,'V> + 3 overloads member After: lowerExclusive: 'K -> Series<'K,'V> member Aggregate: aggregation: Aggregation<'K> * keySelector: Func<DataSegment<Series<'K,'V>>,'TNewKey> * valueSelector: Func<DataSegment<Series<'K,'V>>,OptionalValue<'R>> -> Series<'TNewKey,'R> (requires equality) + 1 overload member AsyncMaterialize: unit -> Async<Series<'K,'V>> member Before: upperExclusive: 'K -> Series<'K,'V> member Between: lowerInclusive: 'K * upperInclusive: 'K -> Series<'K,'V> member Compare: another: Series<'K,'V> -> Series<'K,Diff<'V>> ...
<summary> The type <c>Series<K, V></c> represents a data series consisting of values `V` indexed by keys `K`. The keys of a series may or may not be ordered </summary>
<category>Core frame and series types</category>
--------------------
new: pairs: Collections.Generic.KeyValuePair<'K,'V> seq -> Series<'K,'V>
new: keys: 'K seq * values: 'V seq -> Series<'K,'V>
new: keys: 'K array * values: 'V array -> Series<'K,'V>
new: index: Indices.IIndex<'K> * vector: IVector<'V> * vectorBuilder: Vectors.IVectorBuilder * indexBuilder: Indices.IIndexBuilder -> Series<'K,'V>
<summary> A function for constructing data frame from a sequence of name - column pairs. This provides a nicer syntactic sugar for `Frame.ofColumns`. </summary>
<example> To create a simple frame with two columns, you can write: <code> frame [ "A" => series [ 1 => 30.0; 2 => 35.0 ] "B" => series [ 1 => 30.0; 3 => 40.0 ] ] </code></example>
<category>Frame construction</category>
<summary> Creates a new data frame where the specified columns are expanded based on runtime structure of the objects they store. A column can be expanded if it is <c>Series<string, T></c> or <c>IDictionary<K, V></c> or if it is any .NET object with readable properties. </summary>
<param name="names">Names of columns in the original data frame to be expanded</param>
<param name="frame">Input data frame whose columns will be expanded</param>
<remarks><example> Given a data frame with a series that contains tuples, you can expand the tuple members and get a frame with columns `S.Item1` and `S.Item2`: <code> let df = frame [ "S" => series [ 1 => (1, "One"); 2 => (2, "Two") ] ] df |> Frame.expandCols ["S"] </code></example></remarks>
<category>Sorting and index manipulation</category>
<category>Accessors and slicing</category>
<summary> Create a series from a sequence of key-value pairs that represent the observations of the series. This function can be used together with the `=>` operator to create key-value pairs. </summary>
<example> // Creates a series with squares of numbers let sqs = series [ 1 => 1.0; 2 => 4.0; 3 => 9.0 ] </example>
<summary> Returns a new series whose values are the results of applying the given function to values of the original series. This function skips over missing values and call the function with just values. It is also aliased using the `$` operator so you can write `series $ func` for `series |> Series.mapValues func`. </summary>
<category>Series transformations</category>
member Frame.ReplaceColumn: column: 'TColumnKey * series: ISeries<'TRowKey> -> unit
member Frame.ReplaceColumn: column: 'TColumnKey * data: 'V seq * lookup: Lookup -> unit
member Frame.ReplaceColumn: column: 'TColumnKey * series: ISeries<'TRowKey> * lookup: Lookup -> unit
member Frame.Merge: otherFrames: Frame<'TRowKey,'TColumnKey> seq -> Frame<'TRowKey,'TColumnKey>
member Frame.Merge: otherFrame: Frame<'TRowKey,'TColumnKey> -> Frame<'TRowKey,'TColumnKey>
static member FrameExtensions.Merge: frame: Frame<'TRowKey,'TColumnKey> * rowKey: 'TRowKey * row: ISeries<'TColumnKey> -> Frame<'TRowKey,'TColumnKey> (requires equality and equality)
<summary> Get the value for the specified key. Returns `None` when the key does not exist or the value is missing. Uses exact lookup semantics for key lookup - use `tryLookup` for more options </summary>
<category>Accessing series data and lookup</category>
<summary> Create a new series that contains values for all provided keys. Uses exact lookup semantics for key lookup - use `lookupAll` for more options </summary>
<param name="keys">A sequence of keys that will form the keys of the retunred sequence</param>
<param name="series">The input series</param>
<category>Accessing series data and lookup</category>
<summary> Return observations with available values. The operation skips over all keys with missing values (such as values created from `null`, `Double.NaN`, or those that are missing due to outer join etc.). </summary>
<category>Accessing series data and lookup</category>
<summary> Returns all keys from the sequence, together with the associated (optional) values. </summary>
<category>Accessing series data and lookup</category>
<summary> Returns a new series whose keys are the results of applying the given function to keys of the original series. </summary>
<category>Series transformations</category>
<summary> Groups a series (ordered or unordered) using the specified key selector (`keySelector`) and then returns a series of (nested) series as the result. The outer series is indexed by the newly produced keys, the nested series are indexed with the original keys. </summary>
<param name="keySelector">Generates a new key that is used for aggregation, based on the original key and value. The new key must support equality testing.</param>
<param name="series">An input series to be grouped.</param>
<category>Grouping, windowing and chunking</category>
module List from Microsoft.FSharp.Collections
--------------------
type List<'T> = | op_Nil | op_ColonColon of Head: 'T * Tail: 'T list interface IReadOnlyList<'T> interface IReadOnlyCollection<'T> interface IEnumerable interface IEnumerable<'T> member GetReverseIndex: rank: int * offset: int -> int member GetSlice: startIndex: int option * endIndex: int option -> 'T list static member Cons: head: 'T * tail: 'T list -> 'T list member Head: 'T with get member IsEmpty: bool with get member Item: index: int -> 'T with get ...
<summary> Groups a series (ordered or unordered) using the specified key selector (`keySelector`) and then aggregates each group into a single value, returned in the resulting series, using the provided `f` function. </summary>
<param name="keySelector">Generates a new key that is used for aggregation, based on the original key and value. The new key must support equality testing.</param>
<param name="f">A function to aggregate each group of collected elements.</param>
<param name="series">An input series to be grouped.</param>
<category>Grouping, windowing and chunking</category>
<summary> Returns the total number of keys in the specified series. This returns the total length of the series, including keys for which there is no value available. </summary>
<category>Accessing series data and lookup</category>
static member Frame.ofRows: rows: Series<'R,#ISeries<'C>> -> Frame<'R,'C> (requires equality and equality)
<summary> Fill missing values of a given type in the frame with a constant value. The fill value is applied to columns whose element type matches the fill value type (using safe numeric widening). This includes both cases where the column type can be widened to the fill value type (e.g. decimal columns filled with a float value) and cases where the fill value can be widened to the column type (e.g. an integer fill value filling float columns). Columns whose element type is incompatible with the fill value type are left unchanged. </summary>
<param name="frame">An input data frame that is to be filled</param>
<param name="value">A constant value that is used to fill all missing values</param>
<category>Missing values</category>
<summary> Groups the rows of a frame by a specified column in the same way as `groupRowsBy`. This function assumes that the values of the specified column are of type `string`. <category>Grouping, windowing and chunking</category> </summary>
<summary> Group rows of a data frame using the specified `selector`. The selector is called with a row key and object series representing the row and should return a new key. The result is a frame with multi-level index, where the first level is formed by the newly created keys. </summary>
<category>Grouping, windowing and chunking</category>
member ObjectSeries.GetAs: column: 'K * fallback: 'R -> 'R
<summary> Given a frame with two-level row index, returns a series indexed by the first part of the key, containing frames representing individual groups. This function can be used if you want to perform a transformation individually on each group (e.g. using `Series.mapValues` after calling `Frame.nest`). </summary>
<category>Hierarchical index operations</category>
<summary> Given a series of frames, returns a new data frame with two-level hierarchical row index, using the series keys as the first component. This function is the dual of `Frame.nest`. <category>Hierarchical index operations</category> </summary>
<summary> Groups the rows of a frame by a specified column in the same way as `groupRowsBy`. This function assumes that the values of the specified column are of type `int`. <category>Grouping, windowing and chunking</category> </summary>
<summary> Builds a new data frame whose row keys are the results of applying the specified function on the row keys of the original data frame. </summary>
<param name="frame">Input data frame to be transformed</param>
<param name="f">Function of one argument that defines the row key mapping</param>
<category>Frame transformations</category>
<summary> Module with helper functions for extracting values from hierarchical tuples </summary>
<category>Primitive types and values</category>
<summary> Flatten a two-level nested tuple into a flat tuple of 3 elements </summary>
<summary> Returns the first and the second value of a three-level hierarchical tuple </summary>
<summary> Returns a series of columns of the data frame indexed by the column keys, which contains those series whose values are convertible to float, and with missing values where the conversion fails. </summary>
<category>Accessing frame data and lookup</category>
<summary> Drop missing values from the specified series. The returned series contains only those keys for which there is a value available in the original one. </summary>
<param name="series">An input series to be filtered</param>
<example><code> let s = series [ 1 => 1.0; 2 => Double.NaN ] s |> Series.dropMissing // val it : Series<int,float> = series [ 1 => 1] </code></example>
<category>Missing values</category>
static member Frame.ofColumns: cols: ('C * #ISeries<'R>) seq -> Frame<'R,'C> (requires equality and equality)
<summary> Groups the elements of the input series in groups based on the keys produced by `level` and then aggregates series representing each group using the specified function `op`. The result is a new series containing the aggregates of each group. This operation is designed to be used with [hierarchical indexing](../frame.html#indexing). </summary>
<param name="series">An input series to be aggregated</param>
<param name="op">A function that takes a series and produces an aggregated result</param>
<param name="level">A delegate that returns a new group key, based on the key in the input series</param>
<category>Hierarchical index operations</category>
<summary> Returns the (non-missing) values of the series as a sequence </summary>
<category>Accessing series data and lookup</category>
<summary> Creates a new data frame resulting from a 'pivot' operation. Consider a denormalized data frame representing a table: column labels are field names & table values are observations of those fields. pivotTable buckets the rows along two axes, according to the results of the functions `rowGrp` and `colGrp`; and then computes a value for the frame of rows that land in each bucket. </summary>
<param name="rowGrp">A function from rowkey & row to group value for the resulting row index</param>
<param name="colGrp">A function from rowkey & row to group value for the resulting col index</param>
<param name="op">A function computing a value from the corresponding bucket frame</param>
<param name="frame">The input data frame to pivot</param>
<category>Grouping, windowing and chunking</category>
<summary> Returns the total number of row keys in the specified frame. This returns the total length of the row series, including keys for which there is no value available. </summary>
<category>Accessing frame data and lookup</category>
static member Stats.mean: series: Series<'K,'V> -> float (requires equality)
<summary>Gets the year component of the date represented by this instance.</summary>
<returns>The year, between 1 and 9999.</returns>
<summary>Represents a double-precision floating-point number.</summary>
type Nullable = static member Compare<'T (requires default constructor and value type and 'T :> ValueType)> : n1: Nullable<'T> * n2: Nullable<'T> -> int static member Equals<'T (requires default constructor and value type and 'T :> ValueType)> : n1: Nullable<'T> * n2: Nullable<'T> -> bool static member GetUnderlyingType: nullableType: Type -> Type static member GetValueRefOrDefaultRef<'T (requires default constructor and value type and 'T :> ValueType)> : nullable: inref<Nullable<'T>> -> inref<'T>
<summary>Supports a value type that can be assigned <see langword="null" />. This class cannot be inherited.</summary>
--------------------
[<Struct>] type Nullable<'T (requires default constructor and value type and 'T :> ValueType)> = new: value: 'T -> unit member Equals: other: obj -> bool member GetHashCode: unit -> int member GetValueOrDefault: unit -> 'T + 1 overload member ToString: unit -> string static member op_Explicit: value: Nullable<'T> -> 'T static member op_Implicit: value: 'T -> Nullable<'T> member HasValue: bool member Value: 'T
<summary>Represents a value type that can be assigned <see langword="null" />.</summary>
<typeparam name="T">The underlying value type of the <see cref="T:System.Nullable`1" /> generic type.</typeparam>
--------------------
Nullable ()
Nullable(value: 'T) : Nullable<'T>
<summary> Returns a new series whose values are the results of applying the given function to values of the original series. This specified function is called even when the value is missing. It returns <c>option<'T></c> so that it can create/eliminate missing values in the result. </summary>
<category>Series transformations</category>
<summary> Fill missing values in the series with a constant value. </summary>
<param name="series">An input series that is to be filled</param>
<param name="value">A constant value that is used to fill all missing values</param>
<category>Missing values</category>
<summary> Fill missing values in the series with the nearest available value (using the specified direction). Note that the series may still contain missing values after call to this function. This operation can only be used on ordered series. </summary>
<param name="series">An input series that is to be filled</param>
<param name="direction">Specifies the direction used when searching for the nearest available value. `Backward` means that we want to look for the first value with a smaller key while `Forward` searches for the nearest greater key.</param>
<example><code> let sample = Series.ofValues [ Double.NaN; 1.0; Double.NaN; 3.0 ] // Returns a series consisting of [1; 1; 3; 3] sample |> Series.fillMissing Direction.Backward // Returns a series consisting of [<missing>; 1; 1; 3] sample |> Series.fillMissing Direction.Forward </code></example>
<category>Missing values</category>
<summary> Specifies in which direction should we look when performing operations such as `Series.Pairwise`. </summary>
<example><code> let abc = [ 1 => "a"; 2 => "b"; 3 => "c" ] |> Series.ofObservations // Using 'Forward' the key of the first element is used abc.Pairwise(direction=Direction.Forward) // [ 1 => ("a", "b"); 2 => ("b", "c") ] // Using 'Backward' the key of the second element is used abc.Pairwise(direction=Direction.Backward) // [ 2 => ("a", "b"); 3 => ("b", "c") ] </code></example>
<category>Parameters and results of various operations</category>
<summary> Fill missing values in the series using the specified function. The specified function is called with all keys for which the series does not contain value and the result of the call is used in place of the missing value. </summary>
<param name="series">An input series that is to be filled</param>
<param name="f">A function that takes key `K` and generates a value to be used in a place where the original series contains a missing value.</param>
<remarks> This function can be used to implement more complex interpolation. For example see [handling missing values in the tutorial](../frame.html#missing) </remarks>
<category>Missing values</category>
member Series.TryGet: key: 'K * lookup: Lookup -> OptionalValue<'V>
<summary> Represents different behaviors of key lookup in series. For unordered series, the only available option is `Lookup.Exact` which finds the exact key - methods fail or return missing value if the key is not available in the index. For ordered series `Lookup.Greater` finds the first greater key (e.g. later date) with a value. `Lookup.Smaller` searches for the first smaller key. The options `Lookup.ExactOrGreater` and `Lookup.ExactOrSmaller` finds the exact key (if it is present) and otherwise search for the nearest larger or smaller key, respectively. </summary>
<category>Parameters and results of various operations</category>
<summary> Lookup a value associated with the specified key or with the nearest smaller key that has a value available. Fails (or returns missing value) only when the specified key is smaller than all available keys. </summary>
<summary> Lookup a value associated with the specified key or with the nearest greater key that has a value available. Fails (or returns missing value) only when the specified key is greater than all available keys. </summary>
module OptionalValue from Deedle
<summary> Provides various helper functions for using the <c>OptionalValue<T></c> type from F# (The functions are similar to those in the standard <c>Option</c> module). </summary>
<category>Primitive types and values</category>
--------------------
type OptionalValue = class end
<summary> Non-generic type that makes it easier to create <c>OptionalValue<T></c> values from C# by benefiting the type inference for generic method invocations. </summary>
<category>Primitive types and values</category>
--------------------
[<Struct>] type OptionalValue<'T> = new: value: 'T -> OptionalValue<'T> override Equals: y: obj -> bool override GetHashCode: unit -> int override ToString: unit -> string member HasValue: bool with get member Value: 'T with get member ValueOrDefault: 'T with get static member Missing: OptionalValue<'T> with get
<summary> Value type that represents a potentially missing value. This is similar to <c>System.Nullable<T></c>, but does not restrict the contained value to be a value type, so it can be used for storing values of any types. When obtained from <c>DataFrame<R, C></c> or <c>Series<K, T></c>, the <c>Value</c> will never be <c>Double.NaN</c> or <c>null</c> (but this is not, in general, checked when constructing the value). The type is only used in C#-friendly API. F# operations generally use expose standard F# <c>option<T></c> type instead. However, there the <c>OptionalValue</c> module contains helper functions for using this type from F# as well as <c>Missing</c> and <c>Present</c> active patterns. </summary>
<category>Primitive types and values</category>
--------------------
OptionalValue ()
new: value: 'T -> OptionalValue<'T>
<summary> Complete active pattern that can be used to pattern match on `OptionalValue<T>`. For example: let optVal = OptionalValue(42) match optVal with | OptionalValue.Missing -> printfn "Empty" | OptionalValue.Present(v) -> printfn "Contains %d" v </summary>
Deedle