To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????C???????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f
SJIS-WIN ??????而????????C???????? 00111111001111110011111100111111001111110011111110001110101001110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f8ea73f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f
EUC-JP ??????而????????C???????? 00111111001111110011111100111111001111110011111110111100101010010011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3fbca93f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f
UTF-8 溜븐뵽溜뺣졎而듬젡溜븍젚溜삳젉C溜븐뵽溜뽯졋溜쥷 11101111101001111000101111101011101110001001000011101011101101011011110111101111101001111000101111101011101110101010001111101100101000011000111011101000100000001000110011101011100100111010110011101100101000001010000111101111101001111000101111101011101110001000110111101100101000001001101011101111101001111000101111101100100000101011001111101100101000001000100101000011111011111010011110001011111010111011100010010000111010111011010110111101111011111010011110001011111010111011110110101111111011001010000110001011111011111010011110001011111011001010010110110111 efa78bebb890ebb5bdefa78bebbaa3eca18ee8808ceb93aceca0a1efa78bebb88deca09aefa78bec82b3eca08943efa78bebb890ebb5bdefa78bebbdafeca18befa78beca5b7
UHC 溜븐뵽溜뺣졎而듬젡溜븍젚溜삳젉C溜븐뵽溜뽯졋溜쥷 1110101011111110101110101110110010010100101110111110101011111110100101011110101110100000101110111110110010111011101101011110101110100000100110101110101011111110101110101110101110100000100101101110101011111110101110111110101110100000100010110100001111101010111111101011101011101100100101001011101111101010111111101001011011101011101000001011101011101010111111101010001101000110 eafebaec94bbeafe95eba0bbecbbb5eba09aeafebaeba096eafebbeba08b43eafebaec94bbeafe96eba0baeafea346

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)