To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 枝??芸?諸戡??枝竭?逢??億??? 100011100111110100111111001111111000110001111100001111111000111110010100100111010100000100111111001111111000111001111101111000101001000100111111100010001010011100111111001111111000100110101101001111110011111100111111 8e7d3f3f8c7c3f8f949d413f3f8e7de2913f88a73f3f89ad3f3f3f
EUC-JP 枝??芸?諸戡??枝竭?逢??億??? 101110111101111000111111001111111011011111011101001111111011110111110100110110011010001000111111001111111011101111011110111000111111000100111111101100001010100100111111001111111011001010101111001111110011111100111111 bbde3f3fb7dd3fbdf4d9a23f3fbbdee3f13fb0a93f3fb2af3f3f3f
UTF-8 枝뷰강芸렔諸戡렰렕枝竭렪逢렰렎億골렰렢 111001101001111010011101111010111011011110110000111010101011000010010101111010001000101010111000111010111010000010010100111010001010101110111000111001101000100010100001111010111010000010110000111010111010000010010101111001101001111010011101111001111010101110101101111010111010000010101010111010011000000010100010111010111010000010110000111010111010000010001110111001011000010010000100111010101011001110101000111010111010000010110000111010111010000010100010 e69e9debb7b0eab095e88ab8eba094e8abb8e688a1eba0b0eba095e69e9de7abadeba0aae980a2eba0b0eba08ee58484eab3a8eba0b0eba0a2
UHC 枝뷰강芸렔諸戡렰렕枝竭렪逢렰렎億골렰렢 1111001010101011101110101110010010110000101011011110100111111101100011101010100111110000101100111100101011110001100011101011110110001110101010101111001010101011110010101110011010001110101110001101110011110001100011101011110110001110101001001110010111100010101100001111000110001110101111011000111010110011 f2abbae4b0ade9fd8ea9f0b3caf18ebd8eaaf2abcae68eb8dcf18ebd8ea4e5e2b0f18ebd8eb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)