To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??椅??袁р? 11101010010111110011111100111111100010001101011000111111001111111110010111001101100001001000001000111111 ea5f3f3f88d63f3fe5cd84823f
EUC-JP 鸚??椅??袁р? 11110011110000000011111100111111101100001101100000111111001111111110101011001111101001111110001000111111 f3c03f3fb0d83f3feacfa7e23f
UTF-8 鸚쒓퍒椅썲푻袁р뵒 1110100110111000100110101110110010010010100100111110110110001101100100101110011010100100100001011110110010001101101100101110110110010001101110111110100010100010100000011101000110000000111010111011010110010010 e9b89aec9293ed8d92e6a485ec8db2ed91bbe8a281d180ebb592
UHC 鸚쒓퍒椅썲푻袁р뵒 111001011010010010011100111010101011101110001001111010111111010110111101111001011011111010000111111010101011111010101100111000101001010010010100 e5a49ceabb89ebf5bde5be87eabeace29494

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)