To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 岳??濡?????z岳??濡?????zB 10001010011110000011111100111111100101000100011100111111001111110011111100111111001111110111101010001010011110000011111100111111100101000100011100111111001111110011111100111111001111110111101001000010 8a783f3f94473f3f3f3f3f7a8a783f3f94473f3f3f3f3f7a42
EUC-JP 岳??濡?????z岳??濡?????zB 10110011110110010011111100111111110001111010100000111111001111110011111100111111001111110111101010110011110110010011111100111111110001111010100000111111001111110011111100111111001111110111101001000010 b3d93f3fc7a83f3f3f3f3f7ab3d93f3fc7a83f3f3f3f3f7a42
UTF-8 岳롫젽濡뤿젽溜뷴뎄z岳롫젽濡뤿젽溜뷴뎄zB 111001011011001010110011111010111010000110101011111011001010000010111101111001101011111110100001111010111010010010111111111011001010000010111101111011111010011110001011111010111011011110110100111010111000111010000100011110101110010110110010101100111110101110100001101010111110110010100000101111011110011010111111101000011110101110100100101111111110110010100000101111011110111110100111100010111110101110110111101101001110101110001110100001000111101001000010 e5b2b3eba1abeca0bde6bfa1eba4bfeca0bdefa78bebb7b4eb8e847ae5b2b3eba1abeca0bde6bfa1eba4bfeca0bdefa78bebb7b4eb8e847a42
UHC 岳롫젽濡뤿젽溜뷴뎄z岳롫젽濡뤿젽溜뷴뎄zB 111001001011111110001110111010111010000010101111111010111010000110001111111010111010000010101111111010101111111010111010111001011011010110101100011110101110010010111111100011101110101110100000101011111110101110100001100011111110101110100000101011111110101011111110101110101110010110110101101011000111101001000010 e4bf8eeba0afeba18feba0afeafebae5b5ac7ae4bf8eeba0afeba18feba0afeafebae5b5ac7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)