To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 曜わ?梧??蜈??曜わ?梧??蜈??^ 100101110110101010000010111011010011111110001100111001100011111100111111111001011000010100111111001111111001011101101010100000101110110100111111100011001110011000111111001111111110010110000101001111110011111101011110 976a82ed3f8ce63f3fe5853f3f976a82ed3f8ce63f3fe5853f3f5e
EUC-JP 曜わ?梧??蜈??曜わ?梧??蜈??^ 110011011100101110100100111011110011111110111000111010000011111100111111111010011110010100111111001111111100110111001011101001001110111100111111101110001110100000111111001111111110100111100101001111110011111101011110 cdcba4ef3fb8e83f3fe9e53f3fcdcba4ef3fb8e83f3fe9e53f3f5e
UTF-8 曜わ슘梧잞쉭蜈욅뼀曜わ슘梧잞쉭蜈욅뼀^ 11100110100110111001110011100011100000101000111111101100100010101001100011100110101000101010011111101100100111101001111011101100100010011010110111101000100111001000100011101100100110101000010111101011101111001000000011100110100110111001110011100011100000101000111111101100100010101001100011100110101000101010011111101100100111101001111011101100100010011010110111101000100111001000100011101100100110101000010111101011101111001000000001011110 e69b9ce3828fec8a98e6a2a7ec9e9eec89ade89c88ec9a85ebbc80e69b9ce3828fec8a98e6a2a7ec9e9eec89ade89c88ec9a85ebbc805e
UHC 曜わ슘梧잞쉭蜈욅뼀曜わ슘梧잞쉭蜈욅뼀^ 11101000111110001010101011101111101111011011011111100111111111001001111111101111101111011010110111101000101001011001111011100111100101101000101111101000111110001010101011101111101111011011011111100111111111001001111111101111101111011010110111101000101001011001111011100111100101101000101101011110 e8f8aaefbdb7e7fc9fefbdade8a59ee7968be8f8aaefbdb7e7fc9fefbdade8a59ee7968b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)