To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????而??????????????? 0011111100111111001111110011111100111111001111111000111010100111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f8ea73f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??????而??????????????? 0011111100111111001111110011111100111111001111111011110010101001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3fbca93f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 溜삳젗溜브콦而듬젚溜븍젍溜뷴뵜溜볥졋溜딅졋溜 111011111010011110001011111011001000001010110011111011001010000010010111111011111010011110001011111010111011100010001100111011001011110110100110111010001000000010001100111010111001001110101100111011001010000010011010111011111010011110001011111010111011100010001101111011001010000010001101111011111010011110001011111010111011011110110100111010111011010110011100111011111010011110001011111010111011001110100101111011001010000110001011111011111010011110001011111010111001010010000101111011001010000110001011111011111010011110001011 efa78bec82b3eca097efa78bebb88cecbda6e8808ceb93aceca09aefa78bebb88deca08defa78bebb7b4ebb59cefa78bebb3a5eca18befa78beb9485eca18befa78b
UHC 溜삳젗溜브콦而듬젚溜븍젍溜뷴뵜溜볥졋溜딅졋溜 1110101011111110101110111110101110100000100100111110101011111110101110101110101010110001100111001110110010111011101101011110101110100000100101101110101011111110101110101110101110100000100011101110101011111110101110101110010110010100100111001110101011111110100100111110101110100000101110101110101011111110100010101110101110100000101110101110101011111110 eafebbeba093eafebaeab19cecbbb5eba096eafebaeba08eeafebae5949ceafe93eba0baeafe8aeba0baeafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)