To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 瓦ゅ?瓮??節?n}瓦ゅ?瓮??節?n{^ 1000101010100010100000101110001100111111111000010100010000111111001111111001000011011111001111110110111001111101100010101010001010000010111000110011111111100001010001000011111100111111100100001101111100111111011011100111101101011110 8aa282e33fe1443f3f90df3f6e7d8aa282e33fe1443f3f90df3f6e7b5e
EUC-JP 瓦ゅ?瓮??節?n}瓦ゅ?瓮??節?n{^ 1011010010100100101001001110010100111111111000011010010100111111001111111100000011100001001111110110111001111101101101001010010010100100111001010011111111100001101001010011111100111111110000001110000100111111011011100111101101011110 b4a4a4e53fe1a53f3fc0e13f6e7db4a4a4e53fe1a53f3fc0e13f6e7b5e
UTF-8 瓦ゅ쩂瓮앶굤節쮗n}瓦ゅ쩂瓮앶굤節쮗n{^ 1110011110010011101001101110001110000010100001011110110010101001100000101110011110010011101011101110110010010101101101101110101010110101101001001110011110101111100000001110110010101110100101110110111001111101111001111001001110100110111000111000001010000101111011001010100110000010111001111001001110101110111011001001010110110110111010101011010110100100111001111010111110000000111011001010111010010111011011100111101101011110 e793a6e38285eca982e793aeec95b6eab5a4e7af80ecae976e7de793a6e38285eca982e793aeec95b6eab5a4e7af80ecae976e7b5e
UHC 瓦ゅ쩂瓮앶굤節쮗n}瓦ゅ쩂瓮앶굤節쮗n{^ 11101000101111111010101011100101101001001001110011101000101101111001110111101001100000101000101011101111101111011010100001101111011011100111110111101000101111111010101011100101101001001001110011101000101101111001110111101001100000101000101011101111101111011010100001101111011011100111101101011110 e8bfaae5a49ce8b79de9828aefbda86f6e7de8bfaae5a49ce8b79de9828aefbda86f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)