To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 娃??曜?????傲??娃??曜?????傲??^ 10001000101000010011111100111111100101110110101000111111001111110011111100111111001111111001100011111100001111110011111110001000101000010011111100111111100101110110101000111111001111110011111100111111001111111001100011111100001111110011111101011110 88a13f3f976a3f3f3f3f3f98fc3f3f88a13f3f976a3f3f3f3f3f98fc3f3f5e
EUC-JP 娃??曜??旿??傲??娃??曜??旿??傲??^ 1011000010100011001111110011111111001101110010110011111100111111100011111100000111110100001111110011111111010000111111100011111100111111101100001010001100111111001111111100110111001011001111110011111110001111110000011111010000111111001111111101000011111110001111110011111101011110 b0a33f3fcdcb3f3f8fc1f43f3fd0fe3f3fb0a33f3fcdcb3f3f8fc1f43f3fd0fe3f3f5e
UTF-8 娃띰쉠曜뱄슘旿울쉘傲됧눀娃띰쉠曜뱄슘旿울쉘傲됪쑍^ 11100101101010001000001111101011100111011011000011101100100010011010000011100110100110111001110011101011101100011000010011101100100010101001100011100110100101111011111111101100100110101011100011101100100010011001100011100101100000101011001011101011100100001010011111101011100010001000000011100101101010001000001111101011100111011011000011101100100010011010000011100110100110111001110011101011101100011000010011101100100010101001100011100110100101111011111111101100100110101011100011101100100010011001100011100101100000101011001011101011100100001010101011101100100100011000110101011110 e5a883eb9db0ec89a0e69b9cebb184ec8a98e697bfec9ab8ec8998e582b2eb90a7eb8880e5a883eb9db0ec89a0e69b9cebb184ec8a98e697bfec9ab8ec8998e582b2eb90aaec918d5e
UHC 娃띰쉠曜뱄슘旿울쉘傲됧눀娃띰쉠曜뱄슘旿울쉘傲됪쑍^ 11101000110111111011011011101111101111011010101011101000111110001011100111101111101111011011011111100111111110101011111111101111101111011010100111100111111011001000100111100101100001111010000111101000110111111011011011101111101111011010101011101000111110001011100111101111101111011011011111100111111110101011111111101111101111011010100111100111111011001000100111100110100111001010110001011110 e8dfb6efbdaae8f8b9efbdb7e7fabfefbda9e7ec89e587a1e8dfb6efbdaae8f8b9efbdb7e7fabfefbda9e7ec89e69cac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)