To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 仲???宜?才? 1001001010000111001111110011111100111111100010110101100000111111100011011100101100111111 92873f3f3f8b583f8dcb3f
EUC-JP 仲???宜?才? 1100001111100111001111110011111100111111101101011011100100111111101110101100110100111111 c3e73f3f3fb5b93fbacd3f
UTF-8 仲렩渽렜宜렧才렑 111001001011101110110010111010111010000010101001111001101011100010111101111010111010000010011100111001011010111010011100111010111010000010100111111001101000100110001101111010111010000010010001 e4bbb2eba0a9e6b8bdeba09ce5ae9ceba0a7e6898deba091
UHC 仲렩渽렜宜렧才렑 11110001111010101000111010110111111011101010101010001110101011101110101111110001100011101011011011101110101001101000111010100110 f1ea8eb7eeaa8eaeebf18eb6eea68ea6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)