Ruby: Can I write multi-line string with no concatenation?
Asked Answered
A

16

504

Is there a way to make this look a little better?

conn.exec 'select attr1, attr2, attr3, attr4, attr5, attr6, attr7 ' +
          'from table1, table2, table3, etc, etc, etc, etc, etc, ' +
          'where etc etc etc etc etc etc etc etc etc etc etc etc etc'

Like, is there a way to imply concatenation?

Abbate answered 25/2, 2010 at 20:48 Comment(0)
H
705

There are pieces to this answer that helped me get what I needed (easy multi-line concatenation WITHOUT extra whitespace), but since none of the actual answers had it, I'm compiling them here:

str = 'this is a multi-line string'\
  ' using implicit concatenation'\
  ' to prevent spare \n\'s'

=> "this is a multi-line string using implicit concatenation to eliminate spare
\\n's"

As a bonus, here's a version using funny HEREDOC syntax (via this link):

p <<END_SQL.gsub(/\s+/, " ").strip
SELECT * FROM     users
         ORDER BY users.id DESC
END_SQL
# >> "SELECT * FROM users ORDER BY users.id DESC"

The latter would mostly be for situations that required more flexibility in the processing. I personally don't like it, it puts the processing in a weird place w.r.t. the string (i.e., in front of it, but using instance methods that usually come afterward), but it's there. Note that if you are indenting the last END_SQL identifier (which is common, since this is probably inside a function or module), you will need to use the hyphenated syntax (that is, p <<-END_SQL instead of p <<END_SQL). Otherwise, the indenting whitespace causes the identifier to be interpreted as a continuation of the string.

This doesn't save much typing, but it looks nicer than using + signs, to me.

Also (I say in an edit, several years later), if you're using Ruby 2.3+, the operator <<~ is also available, which removes extra indentation from the final string. You should be able to remove the .gsub invocation, in that case (although it might depend on both the starting indentation and your final needs).

EDIT: Adding one more:

p %{
SELECT * FROM     users
         ORDER BY users.id DESC
}.gsub(/\s+/, " ").strip
# >> "SELECT * FROM users ORDER BY users.id DESC"
Harmonize answered 17/5, 2011 at 15:6 Comment(7)
This is an old question BUT there either is an error in the answer or has been a change in syntax since then. p <<END_SQL should be p <<-END_SQL Otherwise this is The Answer. optionally you can strip leading whitespace with the squiggly HEREDOC operator, <<~END_SQLVatic
It's only an error if the ending identifier is indented (the hyphen tells the ruby interpreter to trim whitespace before making the ending identifier determination). I can put a note mentioning that, though. Also, the ~ is unnecessary, gsub \s+ and strip are already removing leading whitespace.Harmonize
Adding <<~ to the answer would be nice, ended up researching that from there. Personnally, I use <<~MSG.strip ... MSG which also strips the last \n.Semipermeable
When I wrote this answer (nine years ago, sheesh!), Ruby was on 1.9, and <<~ (evidently) wasn't introduced until 2.3. Anyway, ancient history aside, I'll put it in, thanks for bringing it up.Harmonize
Thank you for being one of the few answers which doesn't add extra newlines, which is what I was trying to avoid when I found this question.Target
This solution causes me FrozenError: can't modify frozen StringChukar
.gsub and .strip mutate their targets, and frozen strings are immutable; this probably means you're using the frozen_string_literal pragma in a Rails 2.3+ project (and doing this operation to a literal string). Using one of the other answers that relies on concatenation rather than mutation should work for you (removing extra whitespaces will still require un-freezing the strings, though).Harmonize
P
227

In ruby 2.0 you can now just use %

For example:

    SQL = %{
      SELECT user, name
      FROM users
      WHERE users.id = #{var}
      LIMIT #{var2}
    }
Phenylketonuria answered 28/8, 2013 at 1:33 Comment(7)
Works in Ruby 1.9.3 too.Krug
A string created with this syntax will include both newlines and any indention added to subsequent lines.Straddle
This is even better than <<EOT ......EOT (here document)! it also does interpolation if needed.Stephen
@Stephen A heredoc does interpolation as well.Hives
If using Rails invoking squish on the output should be helpful.Cq
The thing to note is that many commenters are talking about stripping/compacting commas. If you do so, then do not interpolate variables in there. Because it will obviously compact white space in your variables. Bad idea to use interpolation anyway because of injection. Use your driver placeholder system instead which escapes the variables and possibly formats them for each type.Myrmecology
speak correclty. it is not % but %{}Toddy
A
181

Yes, if you don't mind the extra newlines being inserted:

 conn.exec 'select attr1, attr2, attr3, attr4, attr5, attr6, attr7
            from table1, table2, table3, etc, etc, etc, etc, etc,
            where etc etc etc etc etc etc etc etc etc etc etc etc etc'

Alternatively you can use a heredoc:

conn.exec <<-eos
   select attr1, attr2, attr3, attr4, attr5, attr6, attr7
   from table1, table2, table3, etc, etc, etc, etc, etc,
   where etc etc etc etc etc etc etc etc etc etc etc etc etc
eos
Appendicle answered 25/2, 2010 at 20:50 Comment(6)
You could also use %Q(...)Triforium
@Zombies: Newlines are typically allowed in SQL statements and are just treated as ordinary whitespace.Appendicle
see my answer below for an example, you can just use % now.Phenylketonuria
You could also use %(...)Aquinas
Something important to keep in mind if you intentionally add trailing whitespace and use one of these solutions is that your editor may automatically remove trailing space when saving the file. While I normally prefer this behaviour, it has caused unexpected issues for me a few times. A solution is to write your multi-line string like how the OP did in the question.Tay
"if you don't mind the extra newlines being inserted" but the question asks for no newlines...Target
F
70

This question made me go off a rabbit hole to understand how HEREDOC works. Excuse me if the answer became too long.

The squiggly HEREDOC <<~ is what you are looking for when you want to define a multi-line string with newlines and proper indentation (available since Ruby 2.3):

conn.exec <<~EOS
            select attr1, attr2, attr3, attr4, attr5, attr6, attr7
            from table1, table2, table3, etc, etc, etc, etc, etc
            where etc etc etc etc etc etc etc etc etc etc etc etc etc
          EOS

# -> "select...\nfrom...\nwhere..."

If proper indentation is not a concern, then single and double quotes can span multiple lines in Ruby:

conn.exec "select attr1, attr2, attr3, attr4, attr5, attr6, attr7 
           from table1, table2, table3, etc, etc, etc, etc, etc, 
           where etc etc etc etc etc etc etc etc etc etc etc etc etc"    

# -> "select...\n           from...\n           where..."
      

If single or double quotes are cumbersome because that would need lots of escaping, then the percent string literal notation % is the most flexible solution:

conn.exec %(select attr1, attr2, attr3, attr4, attr5, attr6, attr7
            from table1, table2, table3, etc, etc, etc, etc, etc
            where (ProductLine = 'R' OR ProductLine = "S") AND Country = "...")
# -> "select...\n            from...\n            where..."

If the aim is to avoid the newlines (which both the squiggly HEREDOC, quotes and the percent string literal will cause), then a line continuation can be used by putting a backslash \ as the last non-whitespace character in a line. This will continue the line and will cause Ruby to concatenate the Strings back to back (watch out for those spaces inside the quoted string):

conn.exec 'select attr1, attr2, attr3, attr4, attr5, attr6, attr7 ' \
          'from table1, table2, table3, etc, etc, etc, etc, etc, ' \
          'where etc etc etc etc etc etc etc etc etc etc etc etc etc'

# -> "select...from...where..."

If you use Rails, then String.squish will strip the string of leading and trailing space and collapse all consecutive whitespaces (newlines, tabs, and all) into a single space:

conn.exec "select attr1, attr2, attr3, attr4, attr5, attr6, attr7 
           from table1, table2, table3, etc, etc, etc, etc, etc, 
           where etc etc etc etc etc etc etc etc etc etc etc etc etc".squish

# -> "select...attr7 from...etc, where..."

More details:

Ruby HEREDOC Syntax

The Here Document Notation for Strings is a way to designate long blocks of text inline in code. It is started by << followed by a user-defined String (the End of String terminator). All following lines are concatenated until the End of String terminator is found at the very beginning of a line:

puts <<HEREDOC 
Text Text Text Text
Bla Bla
HEREDOC
# -> "Text Text Text Text\nBlaBla"

The End of String terminator can be chosen freely, but it is common to use something like "EOS" (End of String) or something that matches the domain of the String such as "SQL".

HEREDOC supports interpolation by default or when the EOS terminator is double quoted:

price = 10
print <<"EOS"  # comments can be put here
1.) The price is #{price}.
EOS
# -> "1.) The price is 10."

Interpolation can be disabled if the EOS terminator is single quoted:

print <<'EOS' # Disabled interpolation
3.) The price is #{price}.
EOS
# -> "3.) The price is #{price}."

One important restriction of the <<HEREDOC is that the End of String terminator needs to be at the beginning of the line:

  puts <<EOS 
    def foo
      print "foo"
    end
  EOS
EOS
#-> "....def foo\n......print "foo"\n....end\n..EOS"

To get around this, the <<- syntax was created. It allows the EOS terminator to be indented to make the code look nicer. The lines between the <<- and EOS terminator are still used in their full extend including all indentation:

def printExample
  puts <<-EOS # Use <<- to indent End of String terminator
    def foo
      print "foo"
    end
  EOS
end
# -> "....def foo\n......print "foo"\n....end"

Since Ruby 2.3, we now have the squiggly HEREDOC <<~ which removes leading whitespace:

puts <<~EOS # Use the squiggly HEREDOC <<~ to remove leading whitespace (since Ruby 2.3!)
  def foo
    print "foo"
  end
EOS
# -> "def foo\n..print "foo"\nend"

Empty lines and lines which only contains tabs and space are ignored by <<~

puts <<~EOS.inspect 
  Hello

    World!
EOS
#-> "Hello\n..World!"

If both tabs and spaces are used, tabs are considered as equal to 8 spaces. If the least-indented line is in the middle of a tab, this tab is not removed.

puts <<~EOS.inspect
<tab>One Tab
<space><space>Two Spaces
EOS
# -> "\tOne Tab\nTwoSpaces"

HEREDOC can do some crazy stuff such as executing commands using backticks:

puts <<`EOC`            
echo #{price}
echo #{price * 2}
EOC

HEREDOC String definitions can be "stacked", which means that the first EOS terminator (EOSFOO below) will end the first string and start the second (EOSBAR below):

print <<EOSFOO, <<EOSBAR    # you can stack them
I said foo.
EOSFOO
I said bar.
EOSBAR

I don't think anybody would ever use it as such, but the <<EOS is really just a string literal and can be put whereever a string can normally be put:

def func(a,b,c)
  puts a
  puts b
  puts c
end

func(<<THIS, 23, <<THAT) 
Here's a line
or two.
THIS
and here's another.
THAT

If you don't have Ruby 2.3, but Rails >= 3.0 then you can use String.strip_heredoc which does the same as <<~

# File activesupport/lib/active_support/core_ext/string/strip.rb, line 22
class String
  def strip_heredoc
    gsub(/^#{scan(/^[ \t]*(?=\S)/).min}/, "".freeze)
  end
end

puts <<-USAGE.strip_heredoc # If no Ruby 2.3, but Rails >= 3.0
  This command does such and such.

  Supported options are:
    -h         This message
    ...
USAGE

Troubleshooting

If you see errors when Ruby parses your file, then it is most likely that you either have extra leading or trailing spaces with a HEREDOC or extra trailing spaces with a squiggly HEREDOC. For example:

What you see:

    database_yml = <<~EOS
      production:
        database: #{fetch(:user)}
        adapter: postgresql
        pool: 5
        timeout: 5000
    EOS  

What Ruby tells you:

SyntaxError: .../sample.rb:xx: can't find string "EOS" anywhere before EOF
...sample.rb:xx: syntax error, unexpected end-of-input, expecting `end'

What is at fault:

Spot the extra spaces after the terminating EOS

Spot the extra spaces after the terminating EOS.

Percent String Literals

See RubyDoc for how to use the percentage string literals. Percentage string literals start with a percentage sign % followed either an opening parenthesis or any non-alphanumeric character. Then you put your string and end the string literal with the associated closing parenthesis symbol or the non-alphanumeric character.

Thus you could use %(...), %[...], %{...} or e.g. %+...+.

No matter which symbol or parentheses (round, square, curly) the result is always the same: Just a string literal.

You simply chose the terminators that are the least likely to be used in the string inside, reducing the need to do escaping. For example, in SPARQL queries {} and () and [] are all used, so it might make sense to use %$...$. In SQL, [] and () are common, thus %{} could be a good pick.

Last Words

Last, to get the answer to the original question "Is there a way to imply concatenation?" answered: Ruby always implies concatenation if two strings (single and double quoted) are found back to back:

puts "select..." 'from table...' "where..."
# -> "select...from table...where..."

The caveat is that this does not work across line-breaks, because Ruby is interpreting an end of statement and the consequitive line of just strings alone on a line doesn't do anything.

Furuncle answered 19/3, 2020 at 21:13 Comment(2)
This is the proper answer. Thanks for all the details! It clarified a lot of things with HEREDOC.Hanni
I think it will be obvious for many, but took me a while to understand that there is no difference between how %{}, %(), %[] strings are processed. You simply chose the terminators that are the least likely to be used in the string inside, removing (or almost eliminating) the need to do any escaping. For example, in SPARQL queries {} and () are common, and [] could be used for lists, it makes sense to use %$$. In SQL, [] and () are common, %{} could be a good pick (except for T-SQL, maybe). I also like to pick chars that one needs to escape in Bash, \$ looks natural.Orleans
R
51

There are multiple syntaxes for multi-line strings as you've already read. My favorite is Perl-style:

conn.exec %q{select attr1, attr2, attr3, attr4, attr5, attr6, attr7
      from table1, table2, table3, etc, etc, etc, etc, etc,
      where etc etc etc etc etc etc etc etc etc etc etc etc etc}

The multi-line string starts with %q, followed by a {, [ or (, and then terminated by the corresponding reversed character. %q does not allow interpolation; %Q does so you can write things like this:

conn.exec %Q{select attr1, attr2, attr3, attr4, attr5, attr6, attr7
      from #{table_names},
      where etc etc etc etc etc etc etc etc etc etc etc etc etc}

I actually have no idea how these kinds of multi-line strings are called so let's just call them Perl multilines.

Note however that whether you use Perl multilines or heredocs as Mark and Peter have suggested, you'll end up with potentially unnecessary whitespaces. Both in my examples and their examples, the "from" and "where" lines contain leading whitespaces because of their indentation in the code. If this whitespace is not desired then you must use concatenated strings as you are doing now.

Rauwolfia answered 26/2, 2010 at 9:13 Comment(3)
from #{table_names} would not work in this example, as you used %q{}, it would work if you used %q[] or ()Dab
My favorite in this vein is just %{ super multiline string with interpolation support }Oogenesis
strings produced from the %q family will include the newlines which is not equivilent to the original code.Target
R
31

Sometimes is worth to remove new line characters \n like:

conn.exec <<-eos.squish
 select attr1, attr2, attr3, attr4, attr5, attr6, attr7
 from table1, table2, table3, etc, etc, etc, etc, etc,
 where etc etc etc etc etc etc etc etc etc etc etc etc etc
eos
Ruche answered 16/7, 2014 at 6:57 Comment(1)
this is rails based not rubyHighsounding
S
26

You can also use double quotes

x = """
this is 
a multiline
string
"""

2.3.3 :012 > x
 => "\nthis is\na multiline\nstring\n"

If needed to remove line breaks "\n" use backslash "\" at the end of each line

Sundaysundberg answered 29/3, 2017 at 15:39 Comment(8)
You can achieve the same result with the singular double quotes. There is no such thing like triple double quotes in Ruby. It just interprets them as "" + "double quotes with some content" + "".Wentz
Yeah, but ` ""+"\n hello\n "+"" Does look weirdSundaysundberg
Yes, it looks weird, and this is why there is no reason to add extra double quotes when you can just use the singular double quotes with the same result.Wentz
Yes, I meant the plus sign. The double quotes without it look just fine, its readable and and easier to spot instead of a single quote, which should be used on single line strings.Sundaysundberg
I mean that just "x" looks better and works faster than """x""" (which is basically the same as ""+"x"+"") or """""x""""" (which is the same as "" + "" + "x" + "" + ""). It's Ruby, not Python, where you use """ instead of " when you need a multi-line string.Wentz
double quotes will include the newlines which is not equivalent to the original code.Target
@Target that's why I put a not at the end to use `\` if no new lines are needed. You didn't need to downvote.Sundaysundberg
Apologies, I missed that while scouring this page for an answer which did what I needed.Target
L
17

Other options:

#multi line string
multiline_string = <<EOM
This is a very long string
that contains interpolation
like #{4 + 5} \n\n
EOM

puts multiline_string

#another option for multiline string
message = <<-EOF
asdfasdfsador #{2+2} this month.
asdfadsfasdfadsfad.
EOF

puts message
Leifleifer answered 28/12, 2016 at 19:3 Comment(3)
Should change <<EOM to <<-EOM, no?Gittel
Maybe, it seemed to work for my <<-EOF example. My guess is that either way works.Leifleifer
heredoc will include the newlines which is not equivalent to the original code.Target
B
16

Recently with the new features in Ruby 2.3 the new squiggly HEREDOC will let you write our multiline strings in a nice manner with a minimal change so using this combined with the .squish (if you are using rails) will let you write multiline in a nice way! in case of just using ruby, you can do a <<~SQL.split.join(" ") which is almost the same

[1] pry(main)> <<~SQL.squish
[1] pry(main)*   select attr1, attr2, attr3, attr4, attr5, attr6, attr7
[1] pry(main)*   from table1, table2, table3, etc, etc, etc, etc, etc,
[1] pry(main)*   where etc etc etc etc etc etc etc etc etc etc etc etc etc
[1] pry(main)* SQL
=> "select attr1, attr2, attr3, attr4, attr5, attr6, attr7 from table1, table2, table3, etc, etc, etc, etc, etc, where etc etc etc etc etc etc etc etc etc etc etc etc etc"

ref: https://infinum.co/the-capsized-eight/multiline-strings-ruby-2-3-0-the-squiggly-heredoc

Barby answered 1/10, 2017 at 11:42 Comment(1)
squish is rails, not rubyTarget
C
15
conn.exec = <<eos
  select attr1, attr2, attr3, attr4, attr5, attr6, attr7
  from table1, table2, table3, etc, etc, etc, etc, etc,
  where etc etc etc etc etc etc etc etc etc etc etc etc etc
eos
Corelation answered 25/2, 2010 at 20:51 Comment(2)
using heredoc without the '-', as in '<<-eos', will include the additional leader spaces. see Mark Byers' response.Lingo
heredoc will include the newlines which is not equivalent to the original code.Target
D
9

To avoid closing the parentheses for each line you can simply use double quotes with a backslash to escape the newline:

"select attr1, attr2, attr3, attr4, attr5, attr6, attr7 \
from table1, table2, table3, etc, etc, etc, etc, etc, \
where etc etc etc etc etc etc etc etc etc etc etc etc etc"
Dardar answered 15/6, 2016 at 16:34 Comment(1)
This is one of the few answers on this page which actually answers the question!Target
D
7

Elegant Answer Today:

<<~TEXT
Hi #{user.name}, 

Thanks for raising the flag, we're always happy to help you.
Your issue will be resolved within 2 hours.
Please be patient!

Thanks again,
Team #{user.organization.name}
TEXT

Theres a difference in <<-TEXT and <<~TEXT, former retains the spacing inside block and latter doesn't.

There are other options as well. Like concatenation etc. but this one makes more sense in general.

If I am wrong here, let me know how...

Dermatophyte answered 6/2, 2020 at 13:41 Comment(1)
heredoc will include the newlines which is not equivalent to the original code.Target
A
6
conn.exec 'select attr1, attr2, attr3, attr4, attr5, attr6, attr7 ' <<
        'from table1, table2, table3, etc, etc, etc, etc, etc, ' <<
        'where etc etc etc etc etc etc etc etc etc etc etc etc etc'

<< is the concatenation operator for strings

Apprehension answered 25/2, 2010 at 20:52 Comment(2)
+ is the regular concatenation operator, << is the in-place append operator. Using side effects on a literal happens to work here (the first string is modified twice and returned) but IMHO it's weird and makes me do a double-take, where + would be perfectly clear. But maybe I'm just new to Ruby...Doff
This won't work if frozen_string_literal is enabledEmpathize
A
6

If you do mind extra spaces and newlines, you can use

conn.exec %w{select attr1, attr2, attr3, attr4, attr5, attr6, attr7
  from table1, table2, table3, etc, etc, etc, etc, etc,
  where etc etc etc etc etc etc etc etc etc etc etc etc etc} * ' '

(use %W for interpolated strings)

Alita answered 12/3, 2013 at 22:9 Comment(2)
I like this one very much because it allows much more combinations of usage.Kunkle
This will squish multiple adjacent spaces into one. (Its squishing of newline+following indentation is a win here but in middle of line it might be surprising.)Doff
W
5
conn.exec [
  "select attr1, attr2, attr3, ...",
  "from table1, table2, table3, ...",
  "where ..."
].join(' ')

This suggestion has the advantage over here-documents and long strings that auto-indenters can indent each part of the string appropriately. But it comes at an efficiency cost.

Wooded answered 25/2, 2010 at 20:56 Comment(2)
@Aidan, You can replace the commas with backslashes (a la C) and no join (or array) will be needed: The interpreter will concatenate the strings at (I think) parse time, making it pretty quick compared to most of the alternatives. One advantage, though, of joining an array of strings is that some auto-indenters do nicer work than they do with, for example, here-doc strings or with \.Thermoscope
One note, the heredoc syntax <<- will allow appropriate indentation.Harmonize
T
4

Like you, I was also looking for a solution which does not include newlines. (While they may be safe in SQL, they're not safe in my case and I have a large block of text to deal with)

This is arguably just as ugly, but you can backslash-escape newlines in a heredoc to omit them from the resulting string:

conn.exec <<~END_OF_INPUT
    select attr1, attr2, attr3, attr4, attr5, attr6, attr7 \
    from table1, table2, table3, etc, etc, etc, etc, etc, \
    where etc etc etc etc etc etc etc etc etc etc etc etc etc
  END_OF_INPUT

Note that you cannot due this without interpolation (I.E. <<~'END_OF_INPUT') so be careful. #{expressions} will be evaluated here, whereas they will not in your original code. A. Wilson's answer may be better for that reason.

Target answered 14/5, 2020 at 22:34 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.