To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鉛??炎?????n}鉛??炎?????n{^ 100010011001010000111111001111111000100110001010001111110011111100111111001111110011111101101110011111011000100110010100001111110011111110001001100010100011111100111111001111110011111100111111011011100111101101011110 89943f3f898a3f3f3f3f3f6e7d89943f3f898a3f3f3f3f3f6e7b5e
EUC-JP 鉛??炎?????n}鉛??炎?????n{^ 101100011111010000111111001111111011000111101010001111110011111100111111001111110011111101101110011111011011000111110100001111110011111110110001111010100011111100111111001111110011111100111111011011100111101101011110 b1f43f3fb1ea3f3f3f3f3f6e7db1f43f3fb1ea3f3f3f3f3f6e7b5e
UTF-8 鉛뗰슨炎덌풃連곁뙒n}鉛뗰슨炎덌풃連곁뙒n{^ 1110100110001001100110111110101110010111101100001110110010001010101010001110011110000010100011101110101110001101100011001110110110010010100000111110111110100110100110101110101010110011100000011110101110011001100100100110111001111101111010011000100110011011111010111001011110110000111011001000101010101000111001111000001010001110111010111000110110001100111011011001001010000011111011111010011010011010111010101011001110000001111010111001100110010010011011100111101101011110 e9899beb97b0ec8aa8e7828eeb8d8ced9283efa69aeab381eb99926e7de9899beb97b0ec8aa8e7828eeb8d8ced9283efa69aeab381eb99926e7b5e
UHC 鉛뗰슨炎덌풃連곁뙒n}鉛뗰슨炎덌풃連곁뙒n{^ 1110011011100111100010111110111110111101101111001110011011111010100010001110111110111110100010111110011011100110101100001110011110001100100101110110111001111101111001101110011110001011111011111011110110111100111001101111101010001000111011111011111010001011111001101110011010110000111001111000110010010111011011100111101101011110 e6e78befbdbce6fa88efbe8be6e6b0e78c976e7de6e78befbdbce6fa88efbe8be6e6b0e78c976e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)