To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 癲щ?揄??蟻??v癲щ?揄??蟻??vB 1110000110011111100001001000101100111111100111011000100100111111001111111000101101100001001111110011111101110110111000011001111110000100100010110011111110011101100010010011111100111111100010110110000100111111001111110111011001000010 e19f848b3f9d893f3f8b613f3f76e19f848b3f9d893f3f8b613f3f7642
EUC-JP 癲щ?揄??蟻??v癲щ?揄??蟻??vB 1110001010100001101001111110101100111111110110011110100100111111001111111011010111000010001111110011111101110110111000101010000110100111111010110011111111011001111010010011111100111111101101011100001000111111001111110111011001000010 e2a1a7eb3fd9e93f3fb5c23f3f76e2a1a7eb3fd9e93f3fb5c23f3f7642
UTF-8 癲щ끝揄끿독蟻앸솵v癲щ끝揄끿독蟻앸솵vB 11100111100110011011001011010001100010011110101110000001100111011110011010001111100001001110101110000001101111111110101110001111100001011110100010011111101110111110110010010101101110001110110010000110101101010111011011100111100110011011001011010001100010011110101110000001100111011110011010001111100001001110101110000001101111111110101110001111100001011110100010011111101110111110110010010101101110001110110010000110101101010111011001000010 e799b2d189eb819de68f84eb81bfeb8f85e89fbbec95b8ec86b576e799b2d189eb819de68f84eb81bfeb8f85e89fbbec95b8ec86b57642
UHC 癲щ끝揄끿독蟻앸솵v癲щ끝揄끿독蟻앸솵vB 111011111010011010101100111010111011001110100001111010101111000110000101111001111011010110110110111010111111110010011101111010111001100110101010011101101110111110100110101011001110101110110011101000011110101011110001100001011110011110110101101101101110101111111100100111011110101110011001101010100111011001000010 efa6acebb3a1eaf185e7b5b6ebfc9deb99aa76efa6acebb3a1eaf185e7b5b6ebfc9deb99aa7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)