To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 辿多旦達辿多旦達[辿多旦達辿多旦達[^ 1001001001001000100100011011110110010010010101011001001001000010100100100100100010010001101111011001001001010101100100100100001001011011100100100100100010010001101111011001001001010101100100100100001010010010010010001001000110111101100100100101010110010010010000100101101101011110 924891bd92559242924891bd925592425b924891bd92559242924891bd925592425b5e
EUC-JP 辿多旦達辿多旦達[辿多旦達辿多旦達[^ 1100001110101001110000101011111111000011101101101100001110100011110000111010100111000010101111111100001110110110110000111010001101011011110000111010100111000010101111111100001110110110110000111010001111000011101010011100001010111111110000111011011011000011101000110101101101011110 c3a9c2bfc3b6c3a3c3a9c2bfc3b6c3a35bc3a9c2bfc3b6c3a3c3a9c2bfc3b6c3a35b5e
UTF-8 辿多旦達辿多旦達[辿多旦達辿多旦達[^ 111010001011111010111111111001011010010010011010111001101001011110100110111010011000000110010100111010001011111010111111111001011010010010011010111001101001011110100110111010011000000110010100010110111110100010111110101111111110010110100100100110101110011010010111101001101110100110000001100101001110100010111110101111111110010110100100100110101110011010010111101001101110100110000001100101000101101101011110 e8bebfe5a49ae697a6e98194e8bebfe5a49ae697a6e981945be8bebfe5a49ae697a6e98194e8bebfe5a49ae697a6e981945b5e
UHC ?多旦達?多旦達[?多旦達?多旦達[^ 00111111110100101111110111010011101010011101001110111001001111111101001011111101110100111010100111010011101110010101101100111111110100101111110111010011101010011101001110111001001111111101001011111101110100111010100111010011101110010101101101011110 3fd2fdd3a9d3b93fd2fdd3a9d3b95b3fd2fdd3a9d3b93fd2fdd3a9d3b95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)