To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??惟れ???????猷??猥??? 1110000110011111001111111000001001010111111001101111101000111111001111111000100011010010100000101110101000111111001111110011111100111111001111110011111100111111100101110101000100111111001111111110000011001110001111110011111100111111 e19f3f8257e6fa3f3f88d282ea3f3f3f3f3f3f3f97513f3fe0ce3f3f3f
EUC-JP 癲?8踰??惟れ???????猷??猥??嫄 11100010101000010011111110100011101110001110110011111100001111110011111110110000110101001010010011101100001111110011111100111111001111110011111100111111001111111100110110110010001111110011111111100000110100000011111100111111100011111011101010100001 e2a13fa3b8ecfc3f3fb0d4a4ec3f3f3f3f3f3f3fcdb23f3fe0d03f3f8fbaa1
UTF-8 癲쒕8踰딂굢惟れ돵連곗슜柳놅쬅猷몄뵇猥됤넂嫄 111001111001100110110010111011001001001010010101111011111011110010011000111010001011100010110000111010111001010010000010111010101011010110100010111001101000001110011111111000111000001010001100111010111000111110110101111011111010011010011010111010101011001110010111111011001000101010011100111011111010011110001001111010111000011010000101111011001010110010000101111001111000110010110111111010111010101010000100111010111011010110000111111001111000110010100101111010111001000010100100111010111000010010000010111001011010101110000100 e799b2ec9295efbc98e8b8b0eb9482eab5a2e6839fe3828ceb8fb5efa69aeab397ec8a9cefa789eb8685ecac85e78cb7ebaa84ebb587e78ca5eb90a4eb8482e5ab84
UHC 癲쒕8踰딂굢惟れ돵連곗슜柳놅쬅猷몄뵇猥됤넂嫄 1110111110100110100111001110101110100011101110001110101110110010100010101110100010000010100010011110101011101110101010101110110010001001101110001110011011100110101100001110110010011010101010011110101011110111100001101110111110100110100111001110101110100011101110001110110010010100100011011110100011100101100010011110001010000110100100101110101010110001 efa69ceba3b8ebb28ae88289eaeeaaec89b8e6e6b0ec9aa9eaf786efa69ceba3b8ec948de8e589e28692eab1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)