To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 暗??有??恂モ?}暗??有??恂モ?{^ 1000100011000011001111110011111110010111010011000011111100111111100111001001011010000011100000100011111101111101100010001100001100111111001111111001011101001100001111110011111110011100100101101000001110000010001111110111101101011110 88c33f3f974c3f3f9c9683823f7d88c33f3f974c3f3f9c9683823f7b5e
EUC-JP 暗??有??恂モ?}暗??有??恂モ?{^ 1011000011000101001111110011111111001101101011010011111100111111110101111111011010100101111000100011111101111101101100001100010100111111001111111100110110101101001111110011111111010111111101101010010111100010001111110111101101011110 b0c53f3fcdad3f3fd7f6a5e23f7db0c53f3fcdad3f3fd7f6a5e23f7b5e
UTF-8 暗산램有멩윍恂モ봼}暗산램有멩윍恂モ봼{^ 111001101001101010010111111011001000001010110000111010111001111010101000111001101001110010001001111010111010100110101001111011001001110010001101111001101000000110000010111000111000001110100010111010111011010010111100011111011110011010011010100101111110110010000010101100001110101110011110101010001110011010011100100010011110101110101001101010011110110010011100100011011110011010000001100000101110001110000011101000101110101110110100101111000111101101011110 e69a97ec82b0eb9ea8e69c89eba9a9ec9c8de68182e383a2ebb4bc7de69a97ec82b0eb9ea8e69c89eba9a9ec9c8de68182e383a2ebb4bc7b5e
UHC 暗산램有멩윍恂モ봼}暗산램有멩윍恂モ봼{^ 111001001101111010111011111010101011011110100101111010101111001110111000111001101001111110010100111000101110000110101011111000101001010010000011011111011110010011011110101110111110101010110111101001011110101011110011101110001110011010011111100101001110001011100001101010111110001010010100100000110111101101011110 e4debbeab7a5eaf3b8e69f94e2e1abe294837de4debbeab7a5eaf3b8e69f94e2e1abe294837b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)