Haskell: Function to determine the arity of functions?

H

6

24

Is it possible to write a function arity :: a -> Integer to determine the arity of arbitrary functions, such that

> arity map
2
> arity foldr
3
> arity id
1
> arity "hello"
0

?

Hemingway answered 3/12, 2011 at 16:32 Comment(3)

I believe it is possible using clever tricks with the type system. Search for variadic or polyvariadic functions in haskell. – Pneumograph 3/12, 2011 at 16:45

I think this is an interesting question, and I'm amazed by max taldykin's answer, but I do wonder - what would you use such a function for? – Misogynist 21/9, 2012 at 15:19

@Frerich At the time of this question I was reading Elements of Programming by Stepanov and McJones where they introduced the type attribute Arity(F) that returns the number of inputs of F. I was curious if I could implement some of the functions they defined in Haskell. – Hemingway 23/9, 2012 at 10:47

C

19

It's easy with OverlappingInstances:

{-# LANGUAGE FlexibleInstances, OverlappingInstances #-}

class Arity f where
  arity :: f -> Int

instance Arity x where
  arity _ = 0

instance Arity f => Arity ((->) a f) where
  arity f = 1 + arity (f undefined)

Upd Found problem. You need to specify non-polymorphic type for polymorphic functions:

arity (foldr :: (a -> Int -> Int) -> Int -> [a] -> Int)

Don't know how to solve this yet.

Upd2 as Sjoerd Visscher commented below "you have to specify a non-polymorphic type, as the answer depends on which type you choose".

Causerie answered 3/12, 2011 at 16:55 Comment(10)

What for do we need OverlappingInstances? – Pneumograph 3/12, 2011 at 17:0

@scravy, instance Arity x is more general than instance Arity ((->) a f). So without extensions GHC can't choose which of this two instances to use for functions. OverlappingInstances instructs GHC that a) such instances are allowed; b) she need to choose most specific one. – Causerie 3/12, 2011 at 17:5

@max Does not work for me:arity const gives me Ambiguous type variable 'a0' in the constraint: (Arity a0) arising from a use of 'arity' – Hemingway 3/12, 2011 at 17:9

It makes sense that you have to specify a non-polymorphic type, as the answer depends on which type you choose, f.e.: arity (foldr :: (a -> (Int -> Int) -> Int -> Int) -> (Int -> Int) -> [a] -> Int -> Int) – Ingunna 3/12, 2011 at 17:35

Ok after some playing around, the solution is to add the IncoherentInstances LANGUAGE pragma ;) – Oolite 3/12, 2011 at 19:15

@Oolite - nice to know there's an actual use for Incoherent Instances :) And +1 for this cool answer, it's awesome to see arity foldr produce 3 in ghci. – Nephelinite 3/12, 2011 at 21:58

@Oolite - buuuut IncoherentInstances gets confused with lambda expressions. arity $ \x y -> 3 produces 0 with Incoherent Instances, 2 without. Perhaps this is an area where IncoherentInstances could be improved? – Nephelinite 3/12, 2011 at 22:7

@DanBurton I'm not really sure what's happening here, but I think lambda expressions have different internal representations than normal functions which are causing this. They might be optimised somehow. For example arity (\x y z -> (x,y,z)) is 3, arity (\x y z -> x)` is 0, arity (\x y z -> (x,y) is 2 and arity (\x y z -> (x,z)) is surprisingly 1. What's to be noticed here is that if all variables on the left side occur on the right side then the result is correct, however if not all of them occur on the right side the result does not make sense. We need some GHC expert :D – Oolite 3/12, 2011 at 22:32

The confusion for lambdas does not requireIncoherentInstances. It is confused by default. For example arity (\x -> x) gives 0. In fact this happens whenever the lambda doesn't do something useful to the parameter. If you do something useful like arity (\x -> x + 1), you get 1. If you do something useful to 2 parameters (like adding them up), you get the 2. I think this has something to do with laziness, if the parameter is not used in a useful way, it doesn't count it as an arity. – Zurheide 21/8, 2015 at 12:30

@Zurheide Sounds like a compiler optimisation is to blame for that 😊 – Blockhouse 23/8, 2015 at 12:51

D

26

Yes, it can be done very, very easily:

arity :: (a -> b) -> Int
arity = const 1

Rationale: If it is a function, you can apply it to exactly 1 argument. Note that haskell syntax makes it impossible to apply to 0, 2 or more arguments as f a b is really (f a) b, i.e. not f applied to a and b, but (f applied to a) applied to b. The result may, of course, be another function that can be applied again, and so forth.

Sounds stupid, but is nothing but the truth.

Decomposition answered 3/12, 2011 at 17:57 Comment(2)

+1 for stupid truth. All Haskell functions have arity 1, because it's OK for functions to produce functions. a -> b -> c is just sugar for a -> (b -> c). – Nephelinite 3/12, 2011 at 21:53

Alright then, is is possible to recursively find the depth of a tree of functions? – Muskrat 5/12, 2011 at 7:38

C

19

It's easy with OverlappingInstances:

{-# LANGUAGE FlexibleInstances, OverlappingInstances #-}

class Arity f where
  arity :: f -> Int

instance Arity x where
  arity _ = 0

instance Arity f => Arity ((->) a f) where
  arity f = 1 + arity (f undefined)

Upd Found problem. You need to specify non-polymorphic type for polymorphic functions:

arity (foldr :: (a -> Int -> Int) -> Int -> [a] -> Int)