To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烏?????汲艤℡?碎??瑤??媛??節??^ 10001001010001110011111100111111001111110011111100111111100010111000001011100100011111101000011110000100001111111110000111101010001111110011111111101010101000100011111100111111100101010101000100111111001111111001000011011111001111110011111101011110 89473f3f3f3f3f8b82e47e87843fe1ea3f3feaa23f3f95513f3f90df3f3f5e
EUC-JP 烏?????汲艤??碎??瑤??媛??節??^ 101100011010100000111111001111110011111100111111001111111011010111100010111001111101111100111111001111111110001011101100001111110011111111110100101001000011111100111111110010011011001000111111001111111100000011100001001111110011111101011110 b1a83f3f3f3f3fb5e2e7df3f3fe2ec3f3ff4a43f3fc9b23f3fc0e13f3f5e
UTF-8 烏띾슣劉ㅵ첎汲艤℡뇺碎좉설瑤노슡媛붻돳節뚮솦^ 11100111100000111000111111101011100111011011111011101100100010101010001111101111101001111000011111100011100001011011010111101100101100101000111011100110101100011011001011101000100010011010010011100010100001001010000111101011100001111011101011100111101000101000111011101100101000101000100111101100100001001010010011100111100100011010010011101011100001011011100011101100100010101010000111100101101010101001101111101011101101101011101111101011100011111011001111100111101011111000000011101011100110101010111011101100100001101010011001011110 e7838feb9dbeec8aa3efa787e385b5ecb28ee6b1b2e889a4e284a1eb87bae7a28eeca289ec84a4e791a4eb85b8ec8aa1e5aa9bebb6bbeb8fb3e7af80eb9aaeec86a65e
UHC 烏띾슣劉ㅵ첎汲艤℡뇺碎좉설瑤노슡媛붻돳節뚮솦^ 111010001010000110001101111010111001101010101111111010101110010110100100111001011010101010011011110100001110001111101011111110101010001011100101100001111001110111100001111011111010000011101010101111001011001111101000111111011011001111101011100110101010110111101010101100001001010011101000100010011011011011101111101111011000110011101011100110011001111101011110 e8a18deb9aafeae5a4e5aa9bd0e3ebfaa2e5879de1efa0eabcb3e8fdb3eb9aadeab094e889b6efbd8ceb999f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)