To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 仲籠?堤耕????仲籠?堤耕????^ 100100101000011111100010110001000011111110010010111001111000110101101011001111110011111100111111001111111001001010000111111000101100010000111111100100101110011110001101011010110011111100111111001111110011111101011110 9287e2c43f92e78d6b3f3f3f3f9287e2c43f92e78d6b3f3f3f3f5e
EUC-JP 仲籠?堤耕??釪?仲籠?堤耕??釪?^ 11000011111001111110010011000110001111111100010011101001101110011100110000111111001111111000111111100011101011010011111111000011111001111110010011000110001111111100010011101001101110011100110000111111001111111000111111100011101011010011111101011110 c3e7e4c63fc4e9b9cc3f3f8fe3ad3fc3e7e4c63fc4e9b9cc3f3f8fe3ad3f5e
UTF-8 仲籠♠堤耕렠렗釪받仲籠♠堤耕렠렗釪밗^ 11100100101110111011001011100111101100011010000011100010100110011010000011100101101000001010010011101000100000001001010111101011101000001010000011101011101000001001011111101001100001111010101011101011101100001001101111100100101110111011001011100111101100011010000011100010100110011010000011100101101000001010010011101000100000001001010111101011101000001010000011101011101000001001011111101001100001111010101011101011101100001001011101011110 e4bbb2e7b1a0e299a0e5a0a4e88095eba0a0eba097e987aaebb09be4bbb2e7b1a0e299a0e5a0a4e88095eba0a0eba097e987aaebb0975e
UHC 仲籠♠堤耕렠렗釪받仲籠♠堤耕렠렗釪밗^ 11110001111010101101011011101011101000101011110011110000101001111100110011101001100011101011000110001110101011001110100111101001101110011101111011110001111010101101011011101011101000101011110011110000101001111100110011101001100011101011000110001110101011001110100111101001101110011101110001011110 f1ead6eba2bcf0a7cce98eb18eace9e9b9def1ead6eba2bcf0a7cce98eb18eace9e9b9dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)