MPSCNNFullyConnected(3) MetalPerformanceShaders.framework MPSCNNFullyConnected(3)

MPSCNNFullyConnected

#import <MPSCNNConvolution.h>

Inherits MPSCNNConvolution.


(nonnull instancetype) - initWithDevice:weights:
(nonnull instancetype) - initWithDevice:convolutionDescriptor:kernelWeights:biasTerms:flags:
(nullable instancetype) - initWithCoder:device:
(nonnull instancetype) - initWithDevice:

This depends on Metal.framework The MPSCNNFullyConnected specifies a fully connected convolution layer a.k.a. Inner product layer. A fully connected CNN layer is one where every input channel is connected to every output channel. The kernel width is equal to width of source image and the kernel height is equal to the height of source image. Width and height of the output is 1x1. Thus, it takes a srcW x srcH x Ni MPSCNNImage, convolves it with Weights[No][SrcW][srcH][Ni] and produces a 1 x 1 x No output. The following must be true:

kernelWidth  == source.width
kernelHeight == source.height
clipRect.size.width == 1
clipRect.size.height == 1


One can think of a fully connected layer as a matrix multiplication that flattens an image into a vector of length srcW*srcH*Ni. The weights are arragned in a matrix of dimension No x (srcW*srcH*Ni) for product output vectors of length No. The strideInPixelsX, strideInPixelsY, and group must be 1. Offset is not applicable and is ignored. Since clipRect is clamped to the destination image bounds, if the destination is 1x1, one doesn't need to set the clipRect.

Note that one can implement an inner product using MPSCNNConvolution by setting

offset = (kernelWidth/2,kernelHeight/2)
clipRect.origin = (ox,oy), clipRect.size = (1,1)
strideX = strideY = group = 1


However, using the MPSCNNFullyConnected for this is better for performance as it lets us choose the most performant method which may not be possible when using a general convolution. For example, we may internally use matrix multiplication or special reduction kernels for a specific platform.

- (nullable instancetype) initWithCoder: (NSCoder *__nonnull) aDecoder(nonnull id< MTLDevice >) device

NSSecureCoding compatability While the standard NSSecureCoding/NSCoding method -initWithCoder: should work, since the file can't know which device your data is allocated on, we have to guess and may guess incorrectly. To avoid that problem, use initWithCoder:device instead.

Parameters:

aDecoder The NSCoder subclass with your serialized MPSKernel
device The MTLDevice on which to make the MPSKernel

Returns:

A new MPSKernel object, or nil if failure.

Reimplemented from MPSCNNConvolution.

- (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device

Standard init with default properties per filter type

Parameters:

device The device that the filter will be used on. May not be NULL.

Returns:

A pointer to the newly initialized object. This will fail, returning nil if the device is not supported. Devices must be MTLFeatureSet_iOS_GPUFamily2_v1 or later.

Reimplemented from MPSCNNConvolution.

- (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device(const MPSCNNConvolutionDescriptor *__nonnull) convolutionDescriptor(const float *__nonnull) kernelWeights(const float *__nullable) biasTerms(MPSCNNConvolutionFlags) flags

Initializes a convolution kernel WARNING: This API is depreated and will be removed in the future. It cannot be used when training. Also serialization/unserialization wont work for MPSCNNConvolution objects created with this init. Please move onto using initWithDevice:weights:.

Parameters:

device The MTLDevice on which this MPSCNNConvolution filter will be used
convolutionDescriptor A pointer to a MPSCNNConvolutionDescriptor.
kernelWeights A pointer to a weights array. Each entry is a float value. The number of entries is = inputFeatureChannels * outputFeatureChannels * kernelHeight * kernelWidth The layout of filter weight is so that it can be reinterpreted as 4D tensor (array) weight[ outputChannels ][ kernelHeight ][ kernelWidth ][ inputChannels / groups ] Weights are converted to half float (fp16) internally for best performance.
biasTerms A pointer to bias terms to be applied to the convolution output. Each entry is a float value. The number of entries is = numberOfOutputFeatureMaps
flags Currently unused. Pass MPSCNNConvolutionFlagsNone

Returns:

A valid MPSCNNConvolution object or nil, if failure.

Reimplemented from MPSCNNConvolution.

- (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device(nonnull id< MPSCNNConvolutionDataSource >) weights

Initializes a fully connected kernel

Parameters:

device The MTLDevice on which this MPSCNNFullyConnected filter will be used
weights A pointer to a object that conforms to the MPSCNNConvolutionDataSource protocol. The MPSCNNConvolutionDataSource protocol declares the methods that an instance of MPSCNNFullyConnected uses to obtain the weights and bias terms for the CNN fully connected filter.

Returns:

A valid MPSCNNFullyConnected object or nil, if failure.

Reimplemented from MPSCNNConvolution.

Generated automatically by Doxygen for MetalPerformanceShaders.framework from the source code.

Mon Jul 9 2018 Version MetalPerformanceShaders-119.3