Skip to content
This repository was archived by the owner on Jan 4, 2026. It is now read-only.
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 1 addition & 2 deletions shell/conf/cluster_hdil16-D14v2-spark21-293.conf
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,7 @@ source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/hdinsight_defaults.con

#Cluster config
defaultProvider="hdinsight"
azureLocation="Central US"
azureLocationShort="centralus"
azureLocation="East US 2"

clusterID="$(get_id_cluster "$(basename $BASH_SOURCE)")"
clusterName="hdil16-D14v2-spark21-${clusterID}" #no ADLS
Expand Down
29 changes: 29 additions & 0 deletions shell/conf/cluster_hdil2-D14v2-spark21-307.conf
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
#load cluster defaults
source "$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/hdinsight_defaults.conf"

#Cluster config
defaultProvider="hdinsight"
azureLocation="East US 2"

clusterID="$(get_id_cluster "$(basename $BASH_SOURCE)")"
clusterName="hdil2-D14v2-spark21-${clusterID}" #no ADLS
#storageAccountName
numberOfNodes="2" #starts at 0 (max 99) 0 is assigned to master
#Node config
headnodeSize="Standard_D14_v2"
vmSize='Standard_D14_v2'
vmType='linux'

attachedVolumes="0"
diskSize="1024"

#queueJobs="true" #enable on cluster config to queue benchmarks after deploy

#details
vmCores="16"
vmRAM="112"

#costs
clusterCostHour="83.71"
clusterType="PaaS"
clusterDescription=""
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ clusterID="$(get_id_cluster "$(basename $BASH_SOURCE)")"
clusterName="hdil16-D14v2-spark21-${clusterID}" #no ADLS
#storageAccountName
storageAccount="hdil295"
numberOfNodes="32" #starts at 0 (max 99) 0 is assigned to master
numberOfNodes="64" #starts at 0 (max 99) 0 is assigned to master
#Node config
headnodeSize="Standard_D14_v2"
vmSize='Standard_D14_v2'
Expand Down
2 changes: 1 addition & 1 deletion shell/conf/hdinsight_defaults.conf
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ CLUSTER_DISKS="RR1"
#Cost per remote/hour $0.024/GB/month USD 24$ /month Locally Redundant
clusterCostDisk="0.17" # 5TB/hr

resourceGroup="Default-Storage-SouthCentralUS"
resourceGroup="alojadlapremium"
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe it's right (I have no idea), but this looks like a typo since I would expect aloja adla.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A resource group is a set of resources in Azure so we can easily manage them. The only one that was available was the one you see. It makes no difference at all and arranges all our clusters, both HDI and adla together for easier management

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great, but all I'm saying is that alojadlapremium looks wrong, because aloja ends in an a and adla starts in an a, but there is only one a between them. Makes sense?

Since we are still reviewing config changes (did I mention that these have no place in a public repo?), I thought that I could at least try to help by pointing out a possible typo in the name of the resource group. If it's definitely right, then nothing to worry about.

Copy link
Copy Markdown
Contributor Author

@alejandromontero alejandromontero Nov 9, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In our case "DLA" makes complete sense:

dla = data lake analytics
adla = azure data lake analytics

The command line of azure uses "az dla" y "az dls" not adla. Not only that but also one of our accounts is alojadw instead of alojaadw (Data Warehouse).

In essence, I wouldn't say it's a typo.

# We use spark as the type as it already incorporates hadoop and hive
hdiType="Spark" #Hadoop
# TODO Remove: should not be used
Expand Down