To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 庸??夷??邑れ? 10010111011001100011111100111111100010001100111000111111001111111001011101010111100000101110101000111111 97663f3f88ce3f3f975782ea3f
EUC-JP 庸??夷??邑れ? 11001101110001110011111100111111101100001101000000111111001111111100110110111000101001001110110000111111 cdc73f3fb0d03f3fcdb8a4ec3f
UTF-8 庸뉖짘夷㎩선邑れ뿷 111001011011101010111000111010111000100110010110111011001010011110011000111001011010010010110111111000111000111010101001111011001000010010100000111010011000001010010001111000111000001010001100111010111011111110110111 e5bab8eb8996eca798e5a4b7e38ea9ec84a0e98291e3828cebbfb7
UHC 庸뉖짘夷㎩선邑れ뿷 111010011011110010000111111010111010001110011111111011001010100010100111111001011011110010110001111010111110100110101010111011001001011110110111 e9bc87eba39feca8a7e5bcb1ebe9aaec97b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)