To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ???誼??醫??v???誼??醫??vB 00111111001111110011111110001011011000100011111100111111111001111100111000111111001111110111011000111111001111110011111110001011011000100011111100111111111001111100111000111111001111110111011001000010 3f3f3f8b623f3fe7ce3f3f763f3f3f8b623f3fe7ce3f3f7642
EUC-JP 濚?Ŧ誼??醫??v濚?Ŧ誼??醫??vB 100011111100100110100001001111111000111110101001101011111011010111000011001111110011111111101110110100000011111100111111011101101000111111001001101000010011111110001111101010011010111110110101110000110011111100111111111011101101000000111111001111110111011001000010 8fc9a13f8fa9afb5c33f3feed03f3f768fc9a13f8fa9afb5c33f3feed03f3f7642
UTF-8 濚밸Ŧ誼양춯醫롪쾻v濚밸Ŧ誼양춯醫롪쾻vB 11100110101111111001101011101011101100001011100011000101101001101110100010101010101111001110110010010110100100011110110010110110101011111110100110000110101010111110101110100001101010101110110010111110101110110111011011100110101111111001101011101011101100001011100011000101101001101110100010101010101111001110110010010110100100011110110010110110101011111110100110000110101010111110101110100001101010101110110010111110101110110111011001000010 e6bf9aebb0b8c5a6e8aabcec9691ecb6afe986abeba1aaecbebb76e6bf9aebb0b8c5a6e8aabcec9691ecb6afe986abeba1aaecbebb7642
UHC 濚밸Ŧ誼양춯醫롪쾻v濚밸Ŧ誼양춯醫롪쾻vB 111001111011100110111001111010111010100010101110111010111111111010111110111001111010110110001100111011001010001010001110111010101011001010010001011101101110011110111001101110011110101110101000101011101110101111111110101111101110011110101101100011001110110010100010100011101110101010110010100100010111011001000010 e7b9b9eba8aeebfebee7ad8ceca28eeab29176e7b9b9eba8aeebfebee7ad8ceca28eeab2917642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)