To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 堤???琮磐??n}堤???琮磐??n{^ 100100101110011100111111001111110011111111111011011010101001010011010110001111110011111101101110011111011001001011100111001111110011111100111111111110110110101010010100110101100011111100111111011011100111101101011110 92e73f3f3ffb6a94d63f3f6e7d92e73f3f3ffb6a94d63f3f6e7b5e
EUC-JP 堤???琮磐??n}堤???琮磐??n{^ 1100010011101001001111110011111100111111100011111100110010110010110010001101100000111111001111110110111001111101110001001110100100111111001111110011111110001111110011001011001011001000110110000011111100111111011011100111101101011110 c4e93f3f3f8fccb2c8d83f3f6e7dc4e93f3f3f8fccb2c8d83f3f6e7b5e
UTF-8 堤비렰렧琮磐렰렫n}堤비렰렧琮磐렰렫n{^ 1110010110100000101001001110101110111001100001001110101110100000101100001110101110100000101001111110011110010000101011101110011110100011100100001110101110100000101100001110101110100000101010110110111001111101111001011010000010100100111010111011100110000100111010111010000010110000111010111010000010100111111001111001000010101110111001111010001110010000111010111010000010110000111010111010000010101011011011100111101101011110 e5a0a4ebb984eba0b0eba0a7e790aee7a390eba0b0eba0ab6e7de5a0a4ebb984eba0b0eba0a7e790aee7a390eba0b0eba0ab6e7b5e
UHC 堤비렰렧琮磐렰렫n}堤비렰렧琮磐렰렫n{^ 11110000101001111011101011110001100011101011110110001110101101101111000011111001110110101111000110001110101111011000111010111001011011100111110111110000101001111011101011110001100011101011110110001110101101101111000011111001110110101111000110001110101111011000111010111001011011100111101101011110 f0a7baf18ebd8eb6f0f9daf18ebd8eb96e7df0a7baf18ebd8eb6f0f9daf18ebd8eb96e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)