To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 汚??蜈??遙ラ?汚??蜈??遙ョ?^ 100010011001100000111111001111111110010110000101001111110011111111101010101000011000001110001001001111111000100110011000001111110011111111100101100001010011111100111111111010101010000110000011100001110011111101011110 89983f3fe5853f3feaa183893f89983f3fe5853f3feaa183873f5e
EUC-JP 汚??蜈??遙ラ?汚??蜈??遙ョ?^ 101100011111100000111111001111111110100111100101001111110011111111110100101000111010010111101001001111111011000111111000001111110011111111101001111001010011111100111111111101001010001110100101111001110011111101011110 b1f83f3fe9e53f3ff4a3a5e93fb1f83f3fe9e53f3ff4a3a5e73f5e
UTF-8 汚놅스蜈랃쉐遙ラ뼡汚놅스蜈랃쉐遙ョ졒^ 11100110101100011001101011101011100001101000010111101100100010101010010011101000100111001000100011101011100111101000001111101100100010011001000011101001100000011001100111100011100000111010100111101011101111001010000111100110101100011001101011101011100001101000010111101100100010101010010011101000100111001000100011101011100111101000001111101100100010011001000011101001100000011001100111100011100000111010011111101100101000011001001001011110 e6b19aeb8685ec8aa4e89c88eb9e83ec8990e98199e383a9ebbca1e6b19aeb8685ec8aa4e89c88eb9e83ec8990e98199e383a7eca1925e
UHC 汚놅스蜈랃쉐遙ラ뼡汚놅스蜈랃쉐遙ョ졒^ 11100111111111011000011011101111101111011011101011101000101001011000110111101111101111011010011011101001101010111010101111101001100101101010010011100111111111011000011011101111101111011011101011101000101001011000110111101111101111011010011011101001101010111010101111100111101000001011111101011110 e7fd86efbdbae8a58defbda6e9ababe996a4e7fd86efbdbae8a58defbda6e9ababe7a0bf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)