To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚?????柔??巍リ???????甕?? 1001101001101010001111110011111100111111001111110011111110001111010111110011111100111111100110111101100110000011100010100011111100111111001111110011111100111111001111110011111111100001010100000011111100111111 9a6a3f3f3f3f3f8f5f3f3f9bd9838a3f3f3f3f3f3f3fe1503f3f
EUC-JP 嗚?????柔??巍リ????洧??甕?? 11010011110010110011111100111111001111110011111100111111101111011100000000111111001111111101011011011011101001011110101000111111001111110011111100111111100011111100011110110100001111110011111111100001101100010011111100111111 d3cb3f3f3f3f3fbdc03f3fd6dba5ea3f3f3f3f8fc7b43f3fe1b13f3f
UTF-8 嗚삳챶履됪쾮柔겹럥巍リ램類띺쐯洧고맓甕곗걖 111001011001011110011010111011001000001010110011111011001011000110110110111011111010011110011111111010111001000010101010111011001011111010101110111001101001111110010100111010101011001010111001111010111001111110100101111001011011011110001101111000111000001110101010111010111001111010101000111011111010011110010000111010111001110110111010111011001001000010101111111001101011010010100111111010101011001110100000111010111010011110010011111001111001010010010101111010101011001110010111111010101011000110010110 e5979aec82b3ecb1b6efa79feb90aaecbeaee69f94eab2b9eb9fa5e5b78de383aaeb9ea8efa790eb9dbaec90afe6b4a7eab3a0eba793e79495eab397eab196
UHC 嗚삳챶履됪쾮柔겹럥巍リ램類띺쐯洧고맓甕곗걖 111001111111000010111011111010111010101010000011111011001010101010001001111001101011001010000101111010101111010110110000111000111000111010001000111010001110010010101011111010101011011110100101111010111011101010001101111010011001110010010011111010101111101110110000111011011001000010100101111010001011100010110000111011001000000110000001 e7f0bbebaa83ecaa89e6b285eaf5b0e38e88e8e4abeab7a5ebba8de99c93eafbb0ed90a5e8b8b0ec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)