To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑?????蹂?????異??遺??孃??弛 10010100101010000011111100111111001111110011111100111111111001101111100000111111001111110011111100111111001111111000100011011001001111110011111110001000111000100011111100111111100110110110111100111111001111111001001001101111 94a83f3f3f3f3fe6f83f3f3f3f3f88d93f3f88e23f3f9b6f3f3f926f
EUC-JP 畑?????蹂?????異??遺??孃??弛 11001000101010100011111100111111001111110011111100111111111011001111101000111111001111110011111100111111001111111011000011011011001111110011111110110000111001000011111100111111110101011101000000111111001111111100001111010000 c8aa3f3f3f3f3fecfa3f3f3f3f3fb0db3f3fb0e43f3fd5d03f3fc3d0
UTF-8 畑띕끂理먬뜮蹂욎뒴烈쒕굞異녑쫩遺듬짋孃뉖톪弛 111001111001010110010001111010111001110110010101111010111000000110000010111011111010011110100100111010111010100010101100111010111001110010101110111010001011100110000010111011001001101010001110111010111001001010110100111011111010011010011111111011001001001010010101111010101011010110011110111001111001010110110000111010111000010110010001111011001010101110101001111010011000000110111010111010111001001110101100111011001010011110001011111001011010110110000011111010111000100110010110111011011000011010101010111001011011110010011011 e79591eb9d95eb8182efa7a4eba8aceb9caee8b982ec9a8eeb92b4efa69fec9295eab59ee795b0eb8591ecaba9e981baeb93aceca78be5ad83eb8996ed86aae5bc9b
UHC 畑띕끂理먬뜮蹂욎뒴烈쒕굞異녑쫩遺듬짋孃뉖톪弛 1110111110100101101101101110101110000101101110001110110010110101100100001110100110001101101011101110101110110011100111101110110010001010101011011110011011101111100111001110101110000010100001101110110010110110101100111110010110100110100000101110101110110110101101011110101110100011100101111110010110111110100001111110101110110111100000101110110010101100 efa5b6eb85b8ecb590e98daeebb39eec8aade6ef9ceb8286ecb6b3e5a682ebb6b5eba397e5be87ebb782ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)