To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 遙ワ?鈺??馭?n}遙ワ?鈺??馭?n{^ 1110101010100001100000111000111100111111111110111100010000111111001111111110100101100110001111110110111001111101111010101010000110000011100011110011111111111011110001000011111100111111111010010110011000111111011011100111101101011110 eaa1838f3ffbc43f3fe9663f6e7deaa1838f3ffbc43f3fe9663f6e7b5e
EUC-JP 遙ワ?鈺??馭?n}遙ワ?鈺??馭?n{^ 11110100101000111010010111101111001111111000111111100011110101010011111100111111111100011100011100111111011011100111110111110100101000111010010111101111001111111000111111100011110101010011111100111111111100011100011100111111011011100111101101011110 f4a3a5ef3f8fe3d53f3ff1c73f6e7df4a3a5ef3f8fe3d53f3ff1c73f6e7b5e
UTF-8 遙ワ쉼鈺롧뵽馭짥n}遙ワ쉼鈺롧뵽馭짥n{^ 1110100110000001100110011110001110000011101011111110110010001001101111001110100110001000101110101110101110100001101001111110101110110101101111011110100110100110101011011110110010100111101001010110111001111101111010011000000110011001111000111000001110101111111011001000100110111100111010011000100010111010111010111010000110100111111010111011010110111101111010011010011010101101111011001010011110100101011011100111101101011110 e98199e383afec89bce988baeba1a7ebb5bde9a6adeca7a56e7de98199e383afec89bce988baeba1a7ebb5bde9a6adeca7a56e7b5e
UHC 遙ワ쉼鈺롧뵽馭짥n}遙ワ쉼鈺롧뵽馭짥n{^ 11101001101010111010101111101111101111011011000011101000101011011000111011100111100101001011101111100101110111111010010001000101011011100111110111101001101010111010101111101111101111011011000011101000101011011000111011100111100101001011101111100101110111111010010001000101011011100111101101011110 e9ababefbdb0e8ad8ee794bbe5dfa4456e7de9ababefbdb0e8ad8ee794bbe5dfa4456e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)