To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 淼樒濫韆仙シ丞ョ 1111101101000101100111101110011110010111100101001110100011100110100100001110010110111100100011111110010110101110 fb459ee79794e8e690e5bc8fe5ae
EUC-JP 淼樒濫韆仙シ丞ョ 1000111111000111111001101101110011101001110011011111010011110000111010001100000011100111100011101011110010111110111001111000111010101110 8fc7e6dce9cdf4f0e8c0e78ebcbee78eae
UTF-8 淼樒濫韆仙シ丞ョ 111001101011011110111100111001101010100010010010111001101011111110101011111010011001111110000110111001001011101110011001111011111011110110111100111001001011100010011110111011111011110110101110 e6b7bce6a892e6bfabe99f86e4bb99efbdbce4b89eefbdae
UHC ??濫韆仙?丞? 001111110011111111010101101111011111010011000111111000001011100100111111111000111010101000111111 3f3fd5bdf4c7e0b93fe3aa3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)