To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畏??孺??怨??搖??誼???l?艾 1000100011011000001111110011111110011011011111010011111100111111100010011000010100111111001111111001110110001010001111110011111110001011011000100011111100111111001111111000001010001100001111111110010010001000 88d83f3f9b7d3f3f89853f3f9d8a3f3f8b623f3f3f828c3fe488
EUC-JP 畏??孺??怨??搖??誼??洹l?艾 10110000110110100011111100111111110101011101111000111111001111111011000111100101001111110011111111011001111010100011111100111111101101011100001100111111001111111000111111000111101110101010001111101100001111111110011111101000 b0da3f3fd5de3f3fb1e53f3fd9ea3f3fb5c33f3f8fc7baa3ec3fe7e8
UTF-8 畏븍맩孺삼쭓怨뺤졋搖삳돃誼꿰벀洹l졄艾 111001111001010110001111111010111011100010001101111010111010011110101001111001011010110110111010111011001000001010111100111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110001011111001101001000010010110111011001000001010110011111010111000111110000011111010001010101010111100111010101011111110110000111010111011001010000000111001101011010010111001111011111011110110001100111011001010000110000100111010001000100110111110 e7958febb88deba7a9e5adbaec82bcecad93e680a8ebbaa4eca18be69096ec82b3eb8f83e8aabceabfb0ebb280e6b4b9efbd8ceca184e889be
UHC 畏븍맩孺삼쭓怨뺤졋搖삳돃誼꿰벀洹l졄艾 1110100011100110101110101110101110010000101100011110101011101000101110111110111110100111100010111110101010110011100101011110110010100000101110101110100011110100101110111110101110001001100101101110101111111110101100101110011110010011101001101110101010110111101000111110110010100000101101011110010011110101 e8e6baeb90b1eae8bbefa78beab395eca0bae8f4bbeb8996ebfeb2e793a6eab7a3eca0b5e4f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)