To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 薔???虞??異?肢?薔???虞??異?梓? 111001010100101100111111001111110011111110001011111100010011111100111111100010001101100100111111100011101000100000111111111001010100101100111111001111110011111110001011111100010011111100111111100010001101100100111111100010001011001000111111 e54b3f3f3f8bf13f3f88d93f8e883fe54b3f3f3f8bf13f3f88d93f88b23f
EUC-JP 薔???虞?祛異?肢?薔???虞?祛異?梓? 11101001101011000011111100111111001111111011011011110011001111111000111111010000110101111011000011011011001111111011101111101000001111111110100110101100001111110011111100111111101101101111001100111111100011111101000011010111101100001101101100111111101100001011010000111111 e9ac3f3f3fb6f33f8fd0d7b0db3fbbe83fe9ac3f3f3fb6f33f8fd0d7b0db3fb0b43f
UTF-8 薔며렎렜虞곁祛異렔肢렓薔며렎렜虞곁祛異렔梓렒 111010001001011010010100111010111010100110110000111010111010000010001110111010111010000010011100111010001001100110011110111010101011001110000001111001111010010110011011111001111001010110110000111010111010000010010100111010001000001010100010111010111010000010010011111010001001011010010100111010111010100110110000111010111010000010001110111010111010000010011100111010001001100110011110111010101011001110000001111001111010010110011011111001111001010110110000111010111010000010010100111001101010001010010011111010111010000010010010 e89694eba9b0eba08eeba09ce8999eeab381e7a59be795b0eba094e882a2eba093e89694eba9b0eba08eeba09ce8999eeab381e7a59be795b0eba094e6a293eba092
UHC 薔며렎렜虞곁祛異렔肢렓薔며렎렜虞곁祛異렔梓렒 1110110111111001101110001110011110001110101001001000111010101110111010011110010110110000111001111100101111100100111011001011011010001110101010011111001010110110100011101010100011101101111110011011100011100111100011101010010010001110101011101110100111100101101100001110011111001011111001001110110010110110100011101010100111101110101010011000111010100111 edf9b8e78ea48eaee9e5b0e7cbe4ecb68ea9f2b68ea8edf9b8e78ea48eaee9e5b0e7cbe4ecb68ea9eea98ea7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)