Dyalog APL Problem Solving Competition 2020 — Phase II

Annotated Solutions

August 2, 2020 – Dimitri Lozeve

Introduction
Problem 1 – Take a Dive
Problem 2 – Another Step in the Proper Direction
Problem 3 – Past Tasks Blast
Problem 4 – Bioinformatics
Problem 5 – Future and Present Value
Problem 6 – Merge
Problem 7 – UPC
Problem 8 – Balancing the Scales
Problem 9 – Upwardly Mobile

Introduction

After Phase I, here are my solutions to Phase II problems. The full code is included in the post, but everything is also available on GitHub.

A PDF of the problems descriptions is available on the competition website, or directly from my GitHub repo.

The submission guidelines gave a template where everything is defined in a Contest2020.Problems Namespace. I kept the default values for ⎕IO and ⎕ML because the problems were not particularly easier with ⎕IO←0.

:Namespace Contest2020

    :Namespace Problems
        (⎕IO ⎕ML ⎕WX)←1 1 3

Problem 1 – Take a Dive

∇ score←dd DiveScore scores
  :If 7=≢scores
      scores←scores[¯2↓2↓⍋scores]
  :ElseIf 5=≢scores
      scores←scores[¯1↓1↓⍋scores]
  :Else
      scores←scores
  :EndIf
  score←2(⍎⍕)dd×+/scores
∇

This is a very straightforward implementation of the algorithm describe in the problem description. I decided to switch explicitly on the size of the input vector because I feel it is more natural. For the cases with 5 or 7 judges, we use Drop (↓) to remove the lowest and highest scores.

At the end, we sum up the scores with +/ and multiply them by dd. The last operation, 2(⍎⍕), is a train using Format (Dyadic) to round to 2 decimal places, and Execute to get actual numbers and not strings.

Problem 2 – Another Step in the Proper Direction

∇ steps←{p}Steps fromTo;segments;width
  width←|-/fromTo
  :If 0=⎕NC'p' ⍝ No left argument: same as Problem 5 of Phase I
      segments←0,⍳width
  :ElseIf p<0 ⍝ -⌊p is the number of equally-sized steps to take
      segments←(-⌊p){0,⍵×⍺÷⍨⍳⍺}width
  :ElseIf p>0 ⍝ p is the step size
      segments←p{⍵⌊⍺×0,⍳⌈⍵÷⍺}width
  :ElseIf p=0 ⍝ As if we took zero step
      segments←0
  :EndIf
  ⍝ Take into account the start point and the direction.
  steps←fromTo{(⊃⍺)+(-×-/⍺)×⍵}segments
∇

This is an extension to Problem 5 of Phase I. In each case, we compute the “segments”, i.e., the steps starting from 0. In a last step, common to all cases, we add the correct starting point and correct the direction if need be.

To compute equally-sized steps, we first divide the segment \([0, 1]\) in p equal segments with (⍳p)÷p. This subdivision can then be multiplied by the width to obtain the required segments.

When p is the step size, we just divide the width by the step size (rounded to the next largest integer) to get the required number of segments. If the last segment is too large, we “crop” it to the width with Minimum (⌊).

Problem 3 – Past Tasks Blast

∇ urls←PastTasks url;r;paths
  r←HttpCommand.Get url
  paths←('[a-zA-Z0-9_/]+\.pdf'⎕S'&')r.Data
  urls←('https://www.dyalog.com/'∘,)¨paths
∇

I decided to use HttpCommand for this task, since it is simply one ]load HttpCommand away and should be platform-independent.

Parsing XML is not something I consider “fun” in the best of cases, and I feel like APL is not the best language to do this kind of thing. Given how simple the task is, I just decided to find the relevant bits with a regular expression using Replace and Search (⎕S).

After finding all the strings vaguely resembling a PDF file name (only alphanumeric characters and underscores, with a .pdf extension), I just concatenate them to the base URL of the Dyalog domain.

Problem 4 – Bioinformatics

The first task can be solved by decomposing it into several functions.

⍝ Test if a DNA string is a reverse palindrome.
isrevp←{⍵≡⌽'TAGC'['ATCG'⍳⍵]}

First, we compute the complement of a DNA string (using simple indexing) and test if its Reverse (⌽) is equal to the original string.

⍝ Generate all subarrays (position, length) pairs, for 4 ≤ length ≤ 12.
subarrays←{⊃,/(⍳⍵),¨¨3↓¨⍳¨12⌊1+⍵-⍳⍵}

We first compute all the possible lengths for each starting point. For instance, the last element cannot have any (position, length) pair associated to it, because there is no three element following it. So we crop the possible lengths to \([3, 12]\). For instance for an array of size 10:

        {3↓¨⍳¨12⌊1+⍵-⍳⍵}10
┌──────────────┬───────────┬─────────┬───────┬─────┬───┬─┬┬┬┐
│4 5 6 7 8 9 10│4 5 6 7 8 9│4 5 6 7 8│4 5 6 7│4 5 6│4 5│4││││
└──────────────┴───────────┴─────────┴───────┴─────┴───┴─┴┴┴┘

Then, we just add the corresponding starting position to each length (1 for the first block, 2 for the second, and so on). Finally, we flatten everything.

∇ r←revp dna;positions
  positions←subarrays⍴dna
  ⍝ Filter subarrays which are reverse palindromes.
  r←↑({isrevp dna[¯1+⍵[1]+⍳⍵[2]]}¨positions)/positions
∇

For each possible (position, length) pair, we get the corresponding DNA substring with dna[¯1+⍵[1]+⍳⍵[2]] (adding ¯1 is necessary because ⎕IO←1). We test if this substring is a reverse palindrome using isrevp above. Replicate (/) then selects only the (position, length) pairs for which the substring is a reverse palindrome.

The second task is just about counting the number of subsets modulo 1,000,000. So we just need to compute \(2^n \mod 1000000\) for any positive integer \(n\leq1000\).

sset←{((1E6|2∘×)⍣⍵)1}

Since we cannot just compute \(2^n\) directly and take the remainder, we use modular arithmetic to stay mod 1,000,000 during the whole computation. The dfn (1E6|2∘×) doubles its argument mod 1,000,000. So we just apply this function \(n\) times using the Power operator (⍣), with an initial value of 1.

Problem 5 – Future and Present Value

First solution: ((1+⊢)⊥⊣) computes the total return for a vector of amounts ⍺ and a vector of rates ⍵. It is applied to every prefix subarray of amounts and rates to get all intermediate values. However, this has quadratic complexity.

rr←(,\⊣)((1+⊢)⊥⊣)¨(,\⊢)

Second solution: We want to be able to use the recurrence relation (recur) and scan through the vectors of amounts and rates, accumulating the total value at every time step. However, APL evaluation is right-associative, so a simple Scan (recur\amounts,¨values) would not give the correct result, since recur is not associative and we need to evaluate it left-to-right. (In any case, in this case, Scan would have quadratic complexity, so would not bring any benefit over the previous solution.) What we need is something akin to Haskell’s scanl function, which would evaluate left to right in \(O(n)\) timeThere is an interesting StackOverflow answer explaining the behaviour of Scan, and compares it to Haskell’s scanl function.

. This is what we do here, accumulating values from left to right. (This is inspired from dfns.ascan, although heavily simplified.)

rr←{recur←{⍵[1]+⍺×1+⍵[2]} ⋄ 1↓⌽⊃{(⊂(⊃⍵)recur⍺),⍵}/⌽⍺,¨⍵}

For the second task, there is an explicit formula for cashflow calculations, so we can just apply it.

pv←{+/⍺÷×\1+⍵}

Problem 6 – Merge

∇ text←templateFile Merge jsonFile;template;ns
  template←⊃⎕NGET templateFile 1
  ns←⎕JSON⊃⎕NGET jsonFile
  ⍝ We use a simple regex search and replace on the
  ⍝ template.
  text←↑('@[a-zA-Z]*@'⎕R{ns getval ¯1↓1↓⍵.Match})template
∇

We first read the template and the JSON values from their files. The ⎕NGET function read simple text files, and ⎕JSON extracts the key-value pairs as a namespace.

Assuming all variable names contain only letters, we match the regex @[a-zA-Z]*@ to match variable names enclosed between @ symbols. The function getval then returns the appropriate value, and we can replace the variable name in the template.

∇ val←ns getval var
  :If ''≡var ⍝ literal '@'
      val←'@'
  :ElseIf (⊂var)∊ns.⎕NL ¯2
      val←⍕ns⍎var
  :Else
      val←'???'
  :EndIf
∇

This function takes the namespace matching the variable names to their respective values, and the name of the variable.

If the variable name is empty, we matched the string @@, which corresponds to a literal @.
If the variable name is present in the namespace, we query the namespace to get the required value.
Otherwise, we have an unknown variable, so we replace it with ???.

Problem 7 – UPC

CheckDigit←{10|-⍵+.×11⍴3 1}

The check digit satisfies the equation \[ 3 x_{1}+x_{2}+3 x_{3}+x_{4}+3 x_{5}+x_{6}+3 x_{7}+x_{8}+3 x_{9}+x_{10}+3 x_{11}+x_{12} \equiv 0 \bmod 10, \] therefore, \[ x_{12} \equiv -(3 x_{1}+x_{2}+3 x_{3}+x_{4}+3 x_{5}+x_{6}+3 x_{7}+x_{8}+3 x_{9}+x_{10}+3 x_{11}) \bmod 10. \]

Translated to APL, we just take the dot product between the first 11 digits of the barcode with 11⍴3 1, negate it, and take the remainder by 10.

⍝ Left and right representations of digits. Decoding
⍝ the binary representation from decimal is more
⍝ compact than writing everything explicitly.
lrepr←⍉(7⍴2)⊤13 25 19 61 35 49 47 59 55 11
rrepr←~¨lrepr

For the second task, the first thing we need to do is save the representation of digits. To save space, I did not encode the binary representation explicitly, instead using a decimal representation that I then decode in base 2. The right representation is just the bitwise negation.

∇ bits←WriteUPC digits;left;right
  :If (11=≢digits)∧∧/digits∊0,⍳9
      left←,lrepr[1+6↑digits;]
      right←,rrepr[1+6↓digits,CheckDigit digits;]
      bits←1 0 1,left,0 1 0 1 0,right,1 0 1
  :Else
      bits←¯1
  :EndIf
∇

First of all, if the vector digits does not have exactly 11 elements, all between 0 and 9, it is an error and we return ¯1.

Then, we take the first 6 digits and encode them with lrepr, and the last 5 digits plus the check digit encoded with rrepr. In each case, adding 1 is necessary because ⎕IO←1. We return the final bit array with the required beginning, middle, and end guard patterns.

∇ digits←ReadUPC bits
  :If 95≠⍴bits ⍝ incorrect number of bits
      digits←¯1
  :Else
      ⍝ Test if the barcode was scanned right-to-left.
      :If 0=2|+/bits[3+⍳7]
          bits←⌽bits
      :EndIf
      digits←({¯1+lrepr⍳⍵}¨(7/⍳6)⊆42↑3↓bits),{¯1+rrepr⍳⍵}¨(7/⍳6)⊆¯42↑¯3↓bits
      :If ~∧/digits∊0,⍳9 ⍝ incorrect parity
          digits←¯1
      :ElseIf (⊃⌽digits)≠CheckDigit ¯1↓digits ⍝ incorrect check digit
          digits←¯1
      :EndIf
  :EndIf
∇

If we don’t have the correct number of bits, we return ¯1.
We test the first digit for its parity, to determine if its actually a left representation. If it’s not, we reverse the bit array.
Then, we take the bit array representing the right digits (¯42↑¯3↓bits), separate the different digits using Partition (⊆), and look up each of them in the rrepr vector using Index Of (⍳). We do the same for the left digits.
Final checks for the range of the digits (i.e., if the representations could not be found in the lrepr and rrepr vectors), and for the check digit.

Problem 8 – Balancing the Scales

∇ parts←Balance nums;subsets;partitions
  ⍝ This is a brute force solution, running in
  ⍝ exponential time. We generate all the possible
  ⍝ partitions, filter out those which are not
  ⍝ balanced, and return the first matching one. There
  ⍝ are more advanced approach running in
  ⍝ pseudo-polynomial time (based on dynamic
  ⍝ programming, see the "Partition problem" Wikipedia
  ⍝ page), but they are not warranted here, as the
  ⍝ input size remains fairly small.

  ⍝ Generate all partitions of a vector of a given
  ⍝ size, as binary mask vectors.
  subsets←{1↓2⊥⍣¯1⍳2*⍵}
  ⍝ Keep only the subsets whose sum is exactly
  ⍝ (+/nums)÷2.
  partitions←nums{((2÷⍨+/⍺)=⍺+.×⍵)/⍵}subsets⍴nums
  :If 0=≢,partitions
      ⍝ If no partition satisfy the above
      ⍝ criterion, we return ⍬.
      parts←⍬
  :Else
      ⍝ Otherwise, we return the first possible
      ⍝ partition.
      parts←nums{((⊂,(⊂~))⊃↓⍉⍵)/¨2⍴⊂⍺}partitions
  :EndIf
∇

Problem 9 – Upwardly Mobile

This is the only problem that I didn’t complete. It required parsing the files containing the graphical representations of the trees, which was needlessly complex and, quite frankly, hard and boring with a language like APL.

However, the next part is interesting: once we have a matrix of coefficients representing the relationships between the weights, we can solve the system of equations. Matrix Divide (⌹) will find one solution to the system. Since the system is overdetermined, we fix A=1 to find one possible solution. Since we want integer weights, the solution we find is smaller than the one we want, and may contain fractional weights. So we multiply everything by the Lowest Common Multiple (∧) to get the smallest integer weights.

∇ weights←Weights filename;mobile;branches;mat
  ⍝ Put your code and comments below here

  ⍝ Parse the mobile input file.
  mobile←↑⊃⎕NGET filename 1
  branches←⍸mobile∊'┌┴┐'
  ⍝ TODO: Build the matrix of coefficients mat.

  ⍝ Solve the system of equations (arbitrarily setting
  ⍝ the first variable at 1 because the system is
  ⍝ overdetermined), then multiply the coefficients by
  ⍝ their least common multiple to get the smallest
  ⍝ integer weights.
  weights←((1∘,)×(∧/÷))mat[;1]⌹1↓[2]mat
∇

    :EndNamespace
:EndNamespace

Table of Contents