To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳????ぜ違??閻??議??蹂λ????? 1000101001111000001111110011111100111111001111111000001010111010100010001110000100111111001111111110100010000101001111110011111110001011011000110011111100111111111001101111100010000011110010010011111100111111001111110011111100111111 8a783f3f3f3f82ba88e13f3fe8853f3f8b633f3fe6f883c93f3f3f3f3f
EUC-JP 岳??靷?ぜ違??閻??議??蹂λ????? 10110011110110010011111100111111100011111110011110111101001111111010010010111100101100001110001100111111001111111110111111100101001111110011111110110101110001000011111100111111111011001111101010100110110010110011111100111111001111110011111100111111 b3d93f3f8fe7bd3fa4bcb0e33f3fefe53f3fb5c43f3fecfaa6cb3f3f3f3f3f
UTF-8 岳묒빖靷뽬ぜ違먥뵺閻롢끉議믣꽱蹂λ뢿驪낅뎿柳 1110010110110010101100111110101110101100100100101110101110111001100101101110100110011101101101111110101110111101101011001110001110000001100111001110100110000001100101011110101110101000101001011110101110110101101110101110100110010110101110111110101110100001101000101110101110000001100010011110100010101101101100001110101110101111101000111110101010111101101100011110100010111001100000101100111010111011111010111010001010111111111011111010011010000111111010111000001010000101111010111000111010111111111011111010011110001001 e5b2b3ebac92ebb996e99db7ebbdace3819ce98195eba8a5ebb5bae996bbeba1a2eb8189e8adb0ebafa3eabdb1e8b982cebbeba2bfefa687eb8285eb8ebfefa789
UHC 岳묒빖靷뽬ぜ違먥뵺閻롢끉議믣꽱蹂λ뢿驪낅뎿柳 1110010010111111100100011110110010010101101110001110110011100110100101101110100010101010101111001110101011011110100100001110001010010100101110001110011110100010100011101110001110000101101111001110110010100001100100101110010110000100101111001110101110110011101001011110101110001111100000101110011010101111100001011110101110001001100100101110101011110111 e4bf91ec95b8ece696e8aabceade90e294b8e7a28ee385bceca192e584bcebb3a5eb8f82e6af85eb8992eaf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)