To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN ??族簇??族簇B 00111111001111111001000110110000111000101100011000111111001111111001000110110000111000101100011001000010 3f3f91b0e2c63f3f91b0e2c642
EUC-JP ??族簇??族簇B 00111111001111111100001010110010111001001100100000111111001111111100001010110010111001001100100001000010 3f3fc2b2e4c83f3fc2b2e4c842
UTF-8 솨섶族簇솨섶族簇B 11101100100001101010100011101100100001001011011011100110100101111000111111100111101100001000011111101100100001101010100011101100100001001011011011100110100101111000111111100111101100001000011101000010 ec86a8ec84b6e6978fe7b087ec86a8ec84b6e6978fe7b08742
UHC 솨섶族簇솨섶族簇B 1011110011011101101111001011101111110000111010011111000011101010101111001101110110111100101110111111000011101001111100001110101001000010 bcddbcbbf0e9f0eabcddbcbbf0e9f0ea42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)