To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???鮟雰???鮟雰B 001111110011111100111111111010011011101110010101101101010011111100111111001111111110100110111011100101011011010101000010 3f3f3fe9bb95b53f3f3fe9bb95b542
EUC-JP ???鮟雰???鮟雰B 001111110011111100111111111100101011110111001010101101110011111100111111001111111111001010111101110010101011011101000010 3f3f3ff2bdcab73f3f3ff2bdcab742
UTF-8 뤱횓삼鮟雰뤱횓삼鮟雰B 11101011101001001011000111101101100110101001001111101100100000101011110011101001101011101001111111101001100110111011000011101011101001001011000111101101100110101001001111101100100000101011110011101001101011101001111111101001100110111011000001000010 eba4b1ed9a93ec82bce9ae9fe99bb0eba4b1ed9a93ec82bce9ae9fe99bb042
UHC 뤱횓삼鮟雰뤱횓삼鮟雰B 100011111101111111000011100011101011101111101111111001001101010111011101110101001000111111011111110000111000111010111011111011111110010011010101110111011101010001000010 8fdfc38ebbefe4d5ddd48fdfc38ebbefe4d5ddd442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)