如何使用 Elixir 打包/解包十六进制字符串(高半字节优先)

How to pack / unpack a hex string (high nibble first) with Elixir

我想知道如何在 Elixir 中使用十六进制字符串。具体来说,我对从十六进制转换为 ASCII 很感兴趣。

在Ruby中,一个实现可能是:

["001C7F616A8B002128C1A33E8100"].pack('H*').gsub(/[^[:print:]]/, '.')

我将如何使用 Elixir 完成这项任务?我试过:

<<00, 01, C7, F6...>>

但这不是字符串十六进制的正确表示。感谢您的时间和协助!

所以我取得了一些进展,但目前正在努力解决这个问题的递归方面。

到目前为止,这是我的解决方案:

defmodule ElixirNetworkTools do
  def decode(payload) do
    upper_payload = String.upcase payload
    case Base.decode16(upper_payload) do
      :error -> decode_with_nonprintable_characters(payload)
      {:ok, decoded_payload} -> decoded_payload
    end
    |> IO.write
  end

def decode_with_nonprintable_characters(payload) do
String.chunk(payload, ~r/\w{2}/)
|> Enum.each(fn(byte) ->
  case Base.decode16(byte) do
    :error -> '.'
    {:ok, decoded_payload} -> decoded_payload
      end
    end)
  end
end

最终的解决方案如下,如果有更清晰或更好的解决方案,我会很想知道的。

defmodule ElixirNetworkTools do
  @doc """
  The decode function takes a hexadecimal payload, such as one generated
  by Snort, and returns the ASCII representation of the string.

  ## Example
  iex> ElixirNetworkTools.decode("436F6E74656E742D4C656E6774683A203132")
  {:ok, "Content-Length: 12"}
  """
  def decode(payload) do
    case _validate_length_of_snort(payload) do
      :error -> raise "Invalid length hex string. Must be even length. Exiting"
      _ -> nil
    end

    decoded = String.upcase(payload)
    |> _do_decode
    |> to_string

    {:ok, decoded}
  end

  @doc """
  Internal function used to manually process the hexadecimal payload,
  and builds a char list of the printable characters. If a character is
  not printable, we instead use periods.

  ## Example
  iex> ElixirNetworkTools._do_decode("436F6E74656E742D4C656E6774683A203132")
  ["Content-Length: 12"]
  """
  def _do_decode(payload) do
    Base.decode16!(payload)
    |> String.chunk(:printable)
    |> Enum.map(fn(chunk) ->
        case String.printable? chunk do
          true -> chunk
          _ -> "."
        end
    end)
  end

  @doc """
  Internal function used to validate the length of the hexadecimal payload.
  Hexadecimal strings should have an even number of characters.

  ## Example
  iex> ElixirNetworkTools._validate_length_of_snort("436F6E74656E742D4C656E6774683A203132")
  :ok
  """
  def _validate_length_of_snort(payload) do
    String.length(payload)
    |> rem(2)
    |> case do
      0 -> :ok
      _ -> :error
    end
  end
end

这是问题的另一种解决方案。开始之前的几件事:

  • 你可以把case: :mixed传递给Base.decode16/2:Base.decode16(string, case: :mixed),因此你之前不需要做upcase。

  • 如果你要在一个无效的字符串上引发,不要费心检查,直接调用 decode16 因为它也会检查大小。

这意味着我们可以从:

开始
decoded = Base.decode16!(string, case: :mixed)

现在您需要替换不可打印的字符。不要使用 String.printable?/1 因为它是关于 UTF-8 而不是 ASCII。我们需要实现自己的功能,但更有意义的是:提升或替换它们?如果有人发送无效数据,似乎必须将其视为错误?如果是这样的话:

def validate_ascii!(<<h, t::binary>>) when h <= 127 do
  validate_ascii!(t)
end

def validate_ascii!(<<>>) do
  true
end

def validate_ascii!(rest) do
  raise "invalid ascii on string starting at: #{rest}"
end

或者你可以只删除最后一个子句,它也会失败。

现在我们可以把它放在一起了:

decoded = Base.decode16!(string, case: :mixed)
validate_ascii!(decoded)
decoded

编辑:如果您需要用点替换非 ascii:

def keep_ascii(<<h, t::binary>>, acc) when h <= 127 do
  keep_ascii(t, acc <> <<h>>)
end

def keep_ascii(<<_, t::binary>>, acc) do
  keep_ascii(t, acc <> ".")
end

def keep_ascii(<<>>, acc) do
  acc
end