To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳????ぜ油?????誼??怨??筌?? 100010100111100000111111001111110011111100111111100000101011101010010110111110110011111100111111001111110011111100111111100010110110001000111111001111111000100110000101001111110011111111100010101000110011111100111111 8a783f3f3f3f82ba96fb3f3f3f3f3f8b623f3f89853f3fe2a33f3f
EUC-JP 岳??堉?ぜ油????Ŋ誼??怨??筌?? 10110011110110010011111100111111100011111011011111111101001111111010010010111100110011001111110100111111001111110011111100111111100011111010100110101011101101011100001100111111001111111011000111100101001111110011111111100100101001010011111100111111 b3d93f3f8fb7fd3fa4bcccfd3f3f3f3f8fa9abb5c33f3fb1e53f3fe4a53f3f
UTF-8 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼붼쨹怨뺤벁筌ㅼ뀟 1110010110110010101100111110101110101100100100101110101110111001100110001110010110100000100010011110101110110110101110111110001110000001100111001110011010110010101110011110101110110000101110001110110010100000101000011110100110001010101100111110110010010110100111001100010110001010111010001010101010111100111010111011011010111100111011001010100010111001111001101000000010101000111010111011101010100100111010111011001010000001111001111010110110001100111000111000010110111100111010111000000010011111 e5b2b3ebac92ebb998e5a089ebb6bbe3819ce6b2b9ebb0b8eca0a1e98ab3ec969cc58ae8aabcebb6bceca8b9e680a8ebbaa4ebb281e7ad8ce385bceb809f
UHC 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼붼쨹怨뺤벁筌ㅼ뀟 111001001011111110010001111011001001010110111001111010111011110010010100111010001010101010111100111010101111101010111001111010111010000010011010111001111110010110111110111010111010100010101111111010111111111010010100111010011010010010010011111010101011001110010101111011001001001110100111111011111010011110100100111011001000010110010110 e4bf91ec95b9ebbc94e8aabceafab9eba09ae7e5beeba8afebfe94e9a493eab395ec93a7efa7a4ec8596

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)