To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳????ぜ油?????誼?┐怨??亦?? 10001010011110000011111100111111001111110011111110000010101110101001011011111011001111110011111100111111001111110011111110001011011000100011111110000100101000101000100110000101001111110011111110010110100100100011111100111111 8a783f3f3f3f82ba96fb3f3f3f3f3f8b623f84a289853f3f96923f3f
EUC-JP 岳??堉?ぜ油????Ŋ誼?┐怨??亦?? 1011001111011001001111110011111110001111101101111111110100111111101001001011110011001100111111010011111100111111001111110011111110001111101010011010101110110101110000110011111110101000101001001011000111100101001111110011111111001011111100100011111100111111 b3d93f3f8fb7fd3fa4bcccfd3f3f3f3f8fa9abb5c33fa8a4b1e53f3fcbf23f3f
UTF-8 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼뷂┐怨쀬삏亦낅컙 1110010110110010101100111110101110101100100100101110101110111001100110001110010110100000100010011110101110110110101110111110001110000001100111001110011010110010101110011110101110110000101110001110110010100000101000011110100110001010101100111110110010010110100111001100010110001010111010001010101010111100111010111011011110000010111000101001010010010000111001101000000010101000111011001000000010101100111011001000001010001111111001001011101010100110111010111000001010000101111011001011101110011001 e5b2b3ebac92ebb998e5a089ebb6bbe3819ce6b2b9ebb0b8eca0a1e98ab3ec969cc58ae8aabcebb782e29490e680a8ec80acec828fe4baa6eb8285ecbb99
UHC 岳묒빘堉붻ぜ油밸젡銳얜Ŋ誼뷂┐怨쀬삏亦낅컙 111001001011111110010001111011001001010110111001111010111011110010010100111010001010101010111100111010101111101010111001111010111010000010011010111001111110010110111110111010111010100010101111111010111111111010010100111011111010011010100100111010101011001110010111111011001001100010010110111001101011001010000101111010111011000010000100 e4bf91ec95b9ebbc94e8aabceafab9eba09ae7e5beeba8afebfe94efa6a4eab397ec9896e6b285ebb084

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)