To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳????ぜ油?????誼??疑?ぜ筌?? 10001010011110000011111100111111001111110011111110000010101110101001011011111011001111110011111100111111001111110011111110001011011000100011111100111111100010110101111000111111100000101011101011100010101000110011111100111111 8a783f3f3f3f82ba96fb3f3f3f3f3f8b623f3f8b5e3f82bae2a33f3f
EUC-JP 岳??堉?ぜ油????Ŋ誼??疑?ぜ筌?? 1011001111011001001111110011111110001111101101111111110100111111101001001011110011001100111111010011111100111111001111110011111110001111101010011010101110110101110000110011111100111111101101011011111100111111101001001011110011100100101001010011111100111111 b3d93f3f8fb7fd3fa4bcccfd3f3f3f3f8fa9abb5c33f3fb5bf3fa4bce4a53f3f
UTF-8 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼붹쾬疑용ぜ筌깊뀹 1110010110110010101100111110101110101100100100101110101110111001100110001110010110100000100010011110101110110110101110111110001110000001100111001110011010110010101110011110101110110000101110001110110010100000101000011110100110001010101100111110110010010110100111001100010110001010111010001010101010111100111010111011011010111001111011001011111010101100111001111001011010010001111011001001101010101001111000111000000110011100111001111010110110001100111010101011100110001010111010111000000010111001 e5b2b3ebac92ebb998e5a089ebb6bbe3819ce6b2b9ebb0b8eca0a1e98ab3ec969cc58ae8aabcebb6b9ecbeace79691ec9aa9e3819ce7ad8ceab98aeb80b9
UHC 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼붹쾬疑용ぜ筌깊뀹 111001001011111110010001111011001001010110111001111010111011110010010100111010001010101010111100111010101111101010111001111010111010000010011010111001111110010110111110111010111010100010101111111010111111111010010100111001101011001010000011111010111111011110111111111010111010101010111100111011111010011110110001111011011000010110101111 e4bf91ec95b9ebbc94e8aabceafab9eba09ae7e5beeba8afebfe94e6b283ebf7bfebaabcefa7b1ed85af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)