To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 揶?????沃??湲??燁??轅??掖??^ 10011101100010000011111100111111001111110011111100111111100101111000000000111111001111111001111111010001001111110011111111111011010110010011111100111111111001110111011000111111001111111001110101110100001111110011111101011110 9d883f3f3f3f3f97803f3f9fd13f3ffb593f3fe7763f3f9d743f3f5e
EUC-JP 揶?????沃??湲??燁??轅??掖??^ 1101100111101000001111110011111100111111001111110011111111001101111000000011111100111111110111101101001100111111001111111000111111001010101100110011111100111111111011011101011100111111001111111101100111010101001111110011111101011110 d9e83f3f3f3f3fcde03f3fded33f3f8fcab33f3fedd73f3fd9d53f3f5e
UTF-8 揶쏁쭅溜뽮여沃뚦쐯湲됧첎燁곷젚轅⑵챿掖볤툈^ 11100110100011111011011011101100100011111000000111101100101011011000010111101111101001111000101111101011101111011010111011101100100101111010110011100110101100101000001111101011100110101010011011101100100100001010111111100110101110011011001011101011100100001010011111101100101100101000111011100111100001111000000111101010101100111011011111101100101000001001101011101000101111011000010111100010100100011011010111101100101100011011111111100110100011101001011011101011101100111010010011101101100010001000100001011110 e68fb6ec8f81ecad85efa78bebbdaeec97ace6b283eb9aa6ec90afe6b9b2eb90a7ecb28ee78781eab3b7eca09ae8bd85e291b5ecb1bfe68e96ebb3a4ed88885e
UHC 揶쏁쭅溜뽮여沃뚦쐯湲됧첎燁곷젚轅⑵챿掖볤툈^ 11100101101010101001101111100111101001111000000111101010111111101001011011101010101111111010100111101000101010101000110011100101100111001001001111101010101110001000100111100101101010101001101111100111101001111000000111101011101000001001011011101010101111111010100111101000101010101000110011100100111110101001001111101010101110001000000101011110 e5aa9be7a781eafe96eabfa9e8aa8ce59c93eab889e5aa9be7a781eba096eabfa9e8aa8ce4fa93eab8815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)