To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???揄??怨??B 001111110011111100111111100111011000100100111111001111111000100110000101001111110011111101000010 3f3f3f9d893f3f89853f3f42
EUC-JP ???揄??怨??B 001111110011111100111111110110011110100100111111001111111011000111100101001111110011111101000010 3f3f3fd9e93f3fb1e53f3f42
UTF-8 蓼욧낍揄깁㎰怨㏓벑B 11101111101001111000001011101100100110101010011111101011100000101000110111100110100011111000010011101010101110011000000111100011100011101011000011100110100000001010100011100011100011111001001111101011101100101001000101000010 efa782ec9aa7eb828de68f84eab981e38eb0e680a8e38f93ebb29142
UHC 蓼욧낍揄깁㎰怨㏓벑B 11101001101001111011111111101010101100111010011111101010111100011011000111101001101001111011111111101010101100111010011111101011100100111011000101000010 e9a7bfeab3a7eaf1b1e9a7bfeab3a7eb93b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)