To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?蒼??輯?蒼??輯B 001111111001000110010011001111110011111110001111010100110011111110010001100100110011111100111111100011110101001101000010 3f91933f3f8f533f91933f3f8f5342
EUC-JP ?蒼??輯?蒼??輯B 001111111100000111110011001111110011111110111101101101000011111111000001111100110011111100111111101111011011010001000010 3fc1f33f3fbdb43fc1f33f3fbdb442
UTF-8 렰蒼롚렦輯렰蒼롚렦輯B 11101011101000001011000011101000100100101011110011101011101000011001101011101011101000001010011011101000101111001010111111101011101000001011000011101000100100101011110011101011101000011001101011101011101000001010011011101000101111001010111101000010 eba0b0e892bceba19aeba0a6e8bcafeba0b0e892bceba19aeba0a6e8bcaf42
UHC 렰蒼롚렦輯렰蒼롚렦輯B 100011101011110111110011111011111000111011011110100011101011010111110010111111101000111010111101111100111110111110001110110111101000111010110101111100101111111001000010 8ebdf3ef8ede8eb5f2fe8ebdf3ef8ede8eb5f2fe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)