To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 俑??逸??醫??v俑??逸??醫??vB 100110001101101000111111001111111000100011101101001111110011111111100111110011100011111100111111011101101001100011011010001111110011111110001000111011010011111100111111111001111100111000111111001111110111011001000010 98da3f3f88ed3f3fe7ce3f3f7698da3f3f88ed3f3fe7ce3f3f7642
EUC-JP 俑??逸??醫??v俑??逸??醫??vB 110100001101110000111111001111111011000011101111001111110011111111101110110100000011111100111111011101101101000011011100001111110011111110110000111011110011111100111111111011101101000000111111001111110111011001000010 d0dc3f3fb0ef3f3feed03f3f76d0dc3f3fb0ef3f3feed03f3f7642
UTF-8 俑앹늽逸뷸뇻醫딅츉v俑앹늽逸뷸뇻醫딅츉vB 111001001011111110010001111011001001010110111001111010111000101010111101111010011000000010111000111010111011011110111000111010111000011110111011111010011000011010101011111010111001010010000101111011001011100010001001011101101110010010111111100100011110110010010101101110011110101110001010101111011110100110000000101110001110101110110111101110001110101110000111101110111110100110000110101010111110101110010100100001011110110010111000100010010111011001000010 e4bf91ec95b9eb8abde980b8ebb7b8eb87bbe986abeb9485ecb88976e4bf91ec95b9eb8abde980b8ebb7b8eb87bbe986abeb9485ecb8897642
UHC 俑앹늽逸뷸뇻醫딅츉v俑앹늽逸뷸뇻醫딅츉vB 111010011011010110011101111011001000100010000110111011001110111110111010111001101011010010100111111011001010001010001010111010111010111010000101011101101110100110110101100111011110110010001000100001101110110011101111101110101110011010110100101001111110110010100010100010101110101110101110100001010111011001000010 e9b59dec8886ecefbae6b4a7eca28aebae8576e9b59dec8886ecefbae6b4a7eca28aebae857642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)