- In the StgAggregatedSales.dtsx SSIS package, drag and drop an Azure Pig Task onto the control flow. Rename it apt_AggregateData.
- Double-click on it to open the Azure HDInsight Pig Task Editor and set the properties as shown in the following screenshot:
- In the script property, insert the following code:
SalesExtractsSource = LOAD 'wasbs:///Import/FactOrdersAggregated.txt'; rmf wasbs:///Export/; STORE SalesExtractsSource INTO 'wasbs:///Export/' USING PigStorage('|');
- The first line holds a reference to the Import/FactOrdersAggregated.txt file. The second line removes (deleting) the directory /Export. Finally, the data ...