To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}Y????????}bE 001111110011111100111111001111110011111100111111001111110011111101111101010110010011111100111111001111110011111100111111001111110011111100111111011111010110001001000101 3f3f3f3f3f3f3f3f7d593f3f3f3f3f3f3f3f7d6245
SJIS-WIN 鬘ッ蠕枅鬘ッ蠕杰}Y鬘ッ蠕枅鬘ッ蠕杰}bE 111010011010000110101111111001011011111010011110011001101110100110100001101011111110010110111110100111100101111001111101010110011110100110100001101011111110010110111110100111100110011011101001101000011010111111100101101111101001111001011110011111010110001001000101 e9a1afe5be9e66e9a1afe5be9e5e7d59e9a1afe5be9e66e9a1afe5be9e5e7d6245
EUC-JP 鬘ッ蠕枅鬘ッ蠕杰}Y鬘ッ蠕枅鬘ッ蠕杰}bE 11110010101000111000111010101111111010101100000011011011110001111111001010100011100011101010111111101010110000001101101110111111011111010101100111110010101000111000111010101111111010101100000011011011110001111111001010100011100011101010111111101010110000001101101110111111011111010110001001000101 f2a38eafeac0dbc7f2a38eafeac0dbbf7d59f2a38eafeac0dbc7f2a38eafeac0dbbf7d6245
UTF-8 鬘ッ蠕枅鬘ッ蠕杰}Y鬘ッ蠕枅鬘ッ蠕杰}bE 1110100110101100100110001110111110111101101011111110100010100000100101011110011010011110100001011110100110101100100110001110111110111101101011111110100010100000100101011110011010011101101100000111110101011001111010011010110010011000111011111011110110101111111010001010000010010101111001101001111010000101111010011010110010011000111011111011110110101111111010001010000010010101111001101001110110110000011111010110001001000101 e9ac98efbdafe8a095e69e85e9ac98efbdafe8a095e69db07d59e9ac98efbdafe8a095e69e85e9ac98efbdafe8a095e69db07d6245
UHC ???????杰}Y???????杰}bE 0011111100111111001111110011111100111111001111110011111111001011111110010111110101011001001111110011111100111111001111110011111100111111001111111100101111111001011111010110001001000101 3f3f3f3f3f3f3fcbf97d593f3f3f3f3f3f3fcbf97d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)