To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????±? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f
SJIS-WIN 癲?8意??儒??癰?????猷??蘂?±猷 11100001100111110011111110000010010101111000100011010011001111110011111110001110111100100011111100111111111000011001111000111111001111110011111100111111001111111001011101010001001111110011111111100101010000010011111110000001011111011001011101010001 e19f3f825788d33f3f8ef23f3fe19e3f3f3f3f3f97513f3fe5413f817d9751
EUC-JP 癲?8意??儒??癰?????猷??蘂?±猷 11100010101000010011111110100011101110001011000011010101001111110011111110111100111101000011111100111111111000011111111000111111001111110011111100111111001111111100110110110010001111110011111111101001101000100011111110100001110111101100110110110010 e2a13fa3b8b0d53f3fbcf43f3fe1fe3f3f3f3f3fcdb23f3fe9a23fa1decdb2
UTF-8 癲쒕8意띶젆儒붽콟癰귥룆柳볩쬅猷몄뵂蘂뚯±猷 1110011110011001101100101110110010010010100101011110111110111100100110001110011010000100100011111110101110011101101101101110110010100000100001101110010110000100100100101110101110110110101111011110110010111101100111111110011110011001101100001110101010110111101001011110101110100011100001101110111110100111100010011110101110110011101010011110110010101100100001011110011110001100101101111110101110101010100001001110101110110101100000101110100010011000100000101110101110011010101011111100001010110001111001111000110010110111 e799b2ec9295efbc98e6848feb9db6eca086e58492ebb6bdecbd9fe799b0eab7a5eba386efa789ebb3a9ecac85e78cb7ebaa84ebb582e89882eb9aafc2b1e78cb7
UHC 癲쒕8意띶젆儒붽콟癰귥룆柳볩쬅猷몄뵂蘂뚯±猷 1110111110100110100111001110101110100011101110001110101111110010100011011110010110100000100010011110101011100011100101001110101010110001100101111110100010111001100000101110110010001111100001011110101011110111100100111110111110100110100111001110101110100011101110001110110010010100100010001110011111011110100011001110110010100001101111101110101110100011 efa69ceba3b8ebf28de5a089eae394eab197e8b982ec8f85eaf793efa69ceba3b8ec9488e7de8ceca1beeba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)