To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 枳???鄭?儲戡??枳???鄭?儲戡??B 1001111001101011001111110011111100111111100100110100000100111111100101101101011110011101010000010011111100111111100111100110101100111111001111110011111110010011010000010011111110010110110101111001110101000001001111110011111101000010 9e6b3f3f3f93413f96d79d413f3f9e6b3f3f3f93413f96d79d413f3f42
EUC-JP 枳?雩?鄭?儲戡??枳?雩?鄭?儲戡??B 110110111100110000111111100011111110011011111010001111111100010110100010001111111100110011011001110110011010001000111111001111111101101111001100001111111000111111100110111110100011111111000101101000100011111111001100110110011101100110100010001111110011111101000010 dbcc3f8fe6fa3fc5a23fccd9d9a23f3fdbcc3f8fe6fa3fc5a23fccd9d9a23f3f42
UTF-8 枳렟雩렮鄭렩儲戡렰렑枳렟雩렮鄭렩儲戡렰렑B 11100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001100001001010110111101011101000001010100111100101100001001011001011100110100010001010000111101011101000001011000011101011101000001001000111100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001100001001010110111101011101000001010100111100101100001001011001011100110100010001010000111101011101000001011000011101011101000001001000101000010 e69eb3eba09fe99ba9eba0aee984adeba0a9e584b2e688a1eba0b0eba091e69eb3eba09fe99ba9eba0aee984adeba0a9e584b2e688a1eba0b0eba09142
UHC 枳렟雩렮鄭렩儲戡렰렑枳렟雩렮鄭렩儲戡렰렑B 1111001010101100100011101011000011101001111011001000111010111011111011111111011110001110101101111110111010111001110010101111000110001110101111011000111010100110111100101010110010001110101100001110100111101100100011101011101111101111111101111000111010110111111011101011100111001010111100011000111010111101100011101010011001000010 f2ac8eb0e9ec8ebbeff78eb7eeb9caf18ebd8ea6f2ac8eb0e9ec8ebbeff78eb7eeb9caf18ebd8ea642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)