To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??竊??諛??嚴щ?違??恂レ?繹 11111010110100000011111100111111111000101000011000111111001111111110011010000111001111110011111110011010100011101000010010001011001111111000100011100001001111110011111110011100100101101000001110001100001111111110001110001000 fad03f3fe2863f3fe6873f3f9a8e848b3f88e13f3f9c96838c3fe388
EUC-JP ???竊??諛??嚴щ?違??恂レ?繹 001111110011111100111111111000111110011000111111001111111110101111100111001111110011111111010011111011101010011111101011001111111011000011100011001111110011111111010111111101101010010111101100001111111110010111101000 3f3f3fe3e63f3febe73f3fd3eea7eb3fb0e33f3fd7f6a5ec3fe5e8
UTF-8 昻뉗떜竊숂몭諛⑸짎嚴щ씞違뗰쫳恂レ챺繹 1110011010011000101110111110101110001001100101111110101110010110100111001110011110101011100010101110110010001000100000101110101110101010101011011110100010101011100110111110001010010001101110001110110010100111100011101110010110011010101101001101000110001001111011001001010010011110111010011000000110010101111010111001011110110000111011001010101110110011111001101000000110000010111000111000001110101100111011001011000110111010111001111011100110111001 e698bbeb8997eb969ce7ab8aec8882ebaaade8ab9be291b8eca78ee59ab4d189ec949ee98195eb97b0ecabb3e68182e383acecb1bae7b9b9
UHC 昻뉗떜竊숂몭諛⑸짎嚴щ씞違뗰쫳恂レ챺繹 1110010011101001100001111110110010001011101100101110111110111100100110011110011110010001100101111110101110110000101010011110101110100011100110101110010111110001101011001110101110011101101100101110101011011110100010111110111110100110100010111110001011100001101010111110110010101010100001111110011010111010 e4e987ec8bb2efbc99e79197ebb0a9eba39ae5f1aceb9db2eade8befa68be2e1abecaa87e6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)