To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??肉③?儒??嵬?????恂ロ?瑤?? 1001101001101010001111110011111110010011111101111000011101000010001111111000111011110010001111110011111110011011110010100011111100111111001111110011111100111111100111001001011010000011100011010011111111101010101000100011111100111111 9a6a3f3f93f787423f8ef23f3f9bca3f3f3f3f3f9c96838d3feaa23f3f
EUC-JP 嗚??肉??儒??嵬??馹?ł恂ロ?瑤?? 1101001111001011001111110011111111000110111110010011111100111111101111001111010000111111001111111101011011001100001111110011111110001111111010011010000100111111100011111010100111001000110101111111011010100101111011010011111111110100101001000011111100111111 d3cb3f3fc6f93f3fbcf43f3fd6cc3f3f8fe9a13f8fa9c8d7f6a5ed3ff4a43f3f
UTF-8 嗚삠굥肉③윀儒산눼嵬됰챶馹섊ł恂ロ닂瑤뗫궑 1110010110010111100110101110110010000010101000001110101010110101101001011110100010000010100010011110001010010001101000101110110010011100100000001110010110000100100100101110110010000010101100001110101110001000101111001110010110110101101011001110101110010000101100001110110010110001101101101110100110100110101110011110110010000100100010101100010110000010111001101000000110000010111000111000001110101101111010111000101110000010111001111001000110100100111010111001011110101011111010101011011010010001 e5979aec82a0eab5a5e88289e291a2ec9c80e58492ec82b0eb88bce5b5aceb90b0ecb1b6e9a6b9ec848ac582e68182e383adeb8b82e791a4eb97abeab691
UHC 嗚삠굥肉③윀儒산눼嵬됰챶馹섊ł恂ロ닂瑤뗫궑 111001111111000010111011111000111000001010001011111010111011111110101000111010011001111110001011111010101110001110111011111010101011010010110100111010001110001110001001111010111010101010000011111011001111000110011000111001111010100110101001111000101110000110101011111011011000100010001011111010001111110110001011111010111000001010100110 e7f0bbe3828bebbfa8e99f8beae3bbeab4b4e8e389ebaa83ecf198e7a9a9e2e1abed888be8fd8beb82a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)