To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??魏??恂μ????鎰??攸??渦??違 111000111010000000111111001111111110100110110000001111110011111110011100100101101000001111001010001111110011111100111111001111111110100001001100001111110011111110011101101111110011111100111111100010010101000100111111001111111000100011100001 e3a03f3fe9b03f3f9c9683ca3f3f3f3fe84c3f3f9dbf3f3f89513f3f88e1
EUC-JP 罌??魏??恂μ?艅??鎰??攸??渦??違 1110011010100010001111110011111111110010101100100011111100111111110101111111011010100110110011000011111110001111110101101111110100111111001111111110111110101101001111110011111111011010110000010011111100111111101100011011001000111111001111111011000011100011 e6a23f3ff2b23f3fd7f6a6cc3f8fd6fd3f3fefad3f3fdac13f3fb1b23f3fb0e3
UTF-8 罌살슕魏섋ㄵ恂μ툛艅덈낑鎰꾣뀆攸됱돖渦긱꺇違 1110011110111101100011001110110010000010101101001110110010001010100101011110100110101101100011111110110010000100100010111110001110000100101101011110011010000001100000101100111010111100111011011000100010011011111010001000100110000101111010111000110110001000111010111000001010010001111010011000111010110000111010101011111010100011111010111000000010000110111001101001010010111000111010111001000010110001111010111000111110010110111001101011100010100110111010101011100010110001111010101011101010000111111010011000000110010101 e7bd8cec82b4ec8a95e9ad8fec848be384b5e68182cebced889be88985eb8d88eb8291e98eb0eabea3eb8086e694b8eb90b1eb8f96e6b8a6eab8b1eaba87e98195
UHC 罌살슕魏섋ㄵ恂μ툛艅덈낑鎰꾣뀆攸됱돖渦긱꺇違 1110010110100010101110111110110010011010101001001110101011100000100110001110100010100100101001011110001011100001101001011110110010111000100100101110011010101001100010001110101110110011101010011110110011110000100001001110011010000101100000101110101011110010100010011110110010001001101000001110100010111110101100011110001110000011101011101110101011011110 e5a2bbec9aa4eae098e8a4a5e2e1a5ecb892e6a988ebb3a9ecf084e68582eaf289ec89a0e8beb1e383aeeade

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)