To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??韋????ギ嚥△?寃??懿??鴉 100010100111100000111111001111111110100011101000001111110011111100111111001111111000001101001101100110101000101110000001101000100011111110011011100000110011111100111111100111001111001000111111001111111110100111101011 8a783f3fe8e83f3f3f3f834d9a8b81a23f9b833f3f9cf23f3fe9eb
EUC-JP 岳??韋????ギ嚥△?寃??懿??鴉 101100111101100100111111001111111111000011101010001111110011111100111111001111111010010110101110110100111110101110100010101001000011111111010101111000110011111100111111110110001111010000111111001111111111001011101101 b3d93f3ff0ea3f3f3f3fa5aed3eba2a43fd5e33f3fd8f43f3ff2ed
UTF-8 岳묒빖韋뉏펺栒우ギ嚥△뫂寃㏛퓩懿몃걞鴉 111001011011001010110011111010111010110010010010111010111011100110010110111010011001111110001011111010111000100110001111111011011000111010111010111001101010000010010010111011001001101010110000111000111000001010101110111001011001101010100101111000101001011010110011111010111010101110000010111001011010111110000011111000111000111110011011111011011001001110101001111001101000011110111111111010111010101010000011111010101011000110011110111010011011010010001001 e5b2b3ebac92ebb996e99f8beb898fed8ebae6a092ec9ab0e382aee59aa5e296b3ebab82e5af83e38f9bed93a9e687bfebaa83eab19ee9b489
UHC 岳묒빖韋뉏펺栒우ギ嚥△뫂寃㏛퓩懿몃걞鴉 1110010010111111100100011110110010010101101110001110101011011111100001111110010010111100100010101110001011100011101111111110110010101011101011101110011010111111101000011110001010010001101001101110101010110010101001111110010010111111100100011110101111110011101110001110101110000001100001111110010010111100 e4bf91ec95b8eadf87e4bc8ae2e3bfecabaee6bfa1e291a6eab2a7e4bf91ebf3b8eb8187e4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)