To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 節??蘊ょ?弱??節①?亦??亦??節?? 100100001101111100111111001111111110010101011101100000101110010100111111100011101110001100111111001111111001000011011111100001110100000000111111100101101001001000111111001111111001011010010010001111110011111110010000110111110011111100111111 90df3f3fe55d82e53f8ee33f3f90df87403f96923f3f96923f3f90df3f3f
EUC-JP 節??蘊ょ?弱??節??亦??亦??節?? 1100000011100001001111110011111111101001101111101010010011100111001111111011110011100101001111110011111111000000111000010011111100111111110010111111001000111111001111111100101111110010001111110011111111000000111000010011111100111111 c0e13f3fe9bea4e73fbce53f3fc0e13f3fcbf23f3fcbf23f3fc0e13f3f
UTF-8 節억쉬蘊ょ댚弱꾣㉤節①뿏亦삭춼亦삯뇠節억쉬 111001111010111110000000111011001001011010110101111011001000100110101100111010001001100010001010111000111000001010000111111010111000110010011010111001011011110010110001111010101011111010100011111000111000100110100100111001111010111110000000111000101001000110100000111010111011111110001111111001001011101010100110111011001000001010101101111011001011011010111100111001001011101010100110111011001000001010101111111010111000011110100000111001111010111110000000111011001001011010110101111011001000100110101100 e7af80ec96b5ec89ace8988ae38287eb8c9ae5bcb1eabea3e389a4e7af80e291a0ebbf8fe4baa6ec82adecb6bce4baa6ec82afeb87a0e7af80ec96b5ec89ac
UHC 節억쉬蘊ょ댚弱꾣㉤節①뿏亦삭춼亦삯뇠節억쉬 111011111011110110111110111011111011110110101100111010001011001110101010111001111000100010111110111001011011000010000100111001101010100010110101111011111011110110101000111001111001011110010100111001101011001010111011111010001010110110011000111001101011001010111011111010011000011110001000111011111011110110111110111011111011110110101100 efbdbeefbdace8b3aae788bee5b084e6a8b5efbda8e79794e6b2bbe8ad98e6b2bbe98788efbdbeefbdac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)