To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 處?蔚?????處?蔚?????^ 100110010111110000111111100010010101010100111111001111110011111100111111001111111001100101111100001111111000100101010101001111110011111100111111001111110011111101011110 997c3f89553f3f3f3f3f997c3f89553f3f3f3f3f5e
EUC-JP 處?蔚?????處?蔚?????^ 110100011101110100111111101100011011011000111111001111110011111100111111001111111101000111011101001111111011000110110110001111110011111100111111001111110011111101011110 d1dd3fb1b63f3f3f3f3fd1dd3fb1b63f3f3f3f3f5e
UTF-8 處렧蔚吳렮렢롒롚處렧蔚吳렮렢롒롘^ 11101000100110011001010111101011101000001010011111101000100101001001101011100101100100001011001111101011101000001010111011101011101000001010001011101011101000011001001011101011101000011001101011101000100110011001010111101011101000001010011111101000100101001001101011100101100100001011001111101011101000001010111011101011101000001010001011101011101000011001001011101011101000011001100001011110 e89995eba0a7e8949ae590b3eba0aeeba0a2eba192eba19ae89995eba0a7e8949ae590b3eba0aeeba0a2eba192eba1985e
UHC 處렧蔚吳렮렢롒롚處렧蔚吳렮렢롒롘^ 111101001010010110001110101101101110101010100101111001111110111110001110101110111000111010110011100011101101011110001110110111101111010010100101100011101011011011101010101001011110011111101111100011101011101110001110101100111000111011010111100011101101110001011110 f4a58eb6eaa5e7ef8ebb8eb38ed78edef4a58eb6eaa5e7ef8ebb8eb38ed78edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)