To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???揄??怨??B 001111110011111100111111100111011000100100111111001111111000100110000101001111110011111101000010 3f3f3f9d893f3f89853f3f42
EUC-JP ???揄??怨??B 001111110011111100111111110110011110100100111111001111111011000111100101001111110011111101000010 3f3f3fd9e93f3fb1e53f3f42
UTF-8 樂뺢낍揄깁썢怨㏓벑B 11101111101001101011111111101011101110101010001011101011100000101000110111100110100011111000010011101010101110011000000111101100100011011010001011100110100000001010100011100011100011111001001111101011101100101001000101000010 efa6bfebbaa2eb828de68f84eab981ec8da2e680a8e38f93ebb29142
UHC 樂뺢낍揄깁썢怨㏓벑B 11101000111110011001010111101010101100111010011111101010111100011011000111101001100110111001010111101010101100111010011111101011100100111011000101000010 e8f995eab3a7eaf1b1e99b95eab3a7eb93b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)