To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?急竊??儀??畏??泣?9蚓 0011111100111111001111111000101110000011100000011010100000111111100010110111110111100010100001100011111100111111100010110101011000111111001111111000100011011000001111110011111110001011100000110011111110000010010110001110010101101101 3f3f3f8b8381a83f8b7de2863f3f8b563f3f88d83f3f8b833f8258e56d
EUC-JP ???泣→?急竊??儀??畏??泣?9蚓 0011111100111111001111111011010111100011101000101010101000111111101101011101111011100011111001100011111100111111101101011011011100111111001111111011000011011010001111110011111110110101111000110011111110100011101110011110100111001110 3f3f3fb5e3a2aa3fb5dee3e63f3fb5b73f3fb0da3f3fb5e33fa3b9e9ce
UTF-8 捻꿔끇泣→쨫急竊숁룄儀뽯뤊畏브퀣泣쇿9蚓 111011111010011010100100111010101011111110010100111010111000000110000111111001101011001110100011111000101000011010010010111011001010100010101011111001101000000010100101111001111010101110001010111011001000100010000001111010111010001110000100111001011000010010000000111010111011110110101111111010111010010010001010111001111001010110001111111010111011100010001100111011011000000010100011111001101011001110100011111011001000011110111111111011111011110010011001111010001001101010010011 efa6a4eabf94eb8187e6b3a3e28692eca8abe680a5e7ab8aec8881eba384e58480ebbdafeba48ae7958febb88ced80a3e6b3a3ec87bfefbc99e89a93
UHC 捻꿔끇泣→쨫急竊숁룄儀뽯뤊畏브퀣泣쇿9蚓 11100110111101111011001011100011100001011011101111101011111010001010000111100110101001001000010111010000111000011110111110111100100110011110011010001111100001001110101111110000100101101110101110001111101110101110100011100110101110101110101010110011100101111110101111101000100110011110010110100011101110011110110011100010 e6f7b2e385bbebe8a1e6a485d0e1efbc99e68f84ebf096eb8fbae8e6baeab397ebe899e5a3b9ece2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)