To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??揖????泣???⑥?掩??泣?┸ 100101101110100100111111001111111001011101001011001111110011111100111111001111111000101110000011001111110011111100111111100001110100010100111111100010011000011000111111001111111000101110000011001111111000010010111101 96e93f3f974b3f3f3f3f8b833f3f3f87453f89863f3f8b833f84bd
EUC-JP 夜??揖????泣?ˇ洹??掩??泣?┸ 110011001110101100111111001111111100110110101100001111110011111100111111001111111011010111100011001111111000111110100010101100001000111111000111101110100011111100111111101100011110011000111111001111111011010111100011001111111010100010111111 cceb3f3fcdac3f3f3f3fb5e33f8fa2b08fc7ba3f3fb1e63f3fb5e33fa8bf
UTF-8 夜껊씛揖퀱嶪용뜉泣먩ˇ洹⑥돖掩뽰룊泣덌┸ 1110010110100100100111001110101010111011100010101110110010010100100110111110011010001111100101101110110110000000101100011110010110110110101010101110110010011010101010011110101110011100100010011110011010110011101000111110101110101000101010011100101110000111111001101011010010111001111000101001000110100101111010111000111110010110111001101000111010101001111010111011110110110000111010111010001110001010111001101011001110100011111010111000110110001100111000101001010010111000 e5a49ceabb8aec949be68f96ed80b1e5b6aaec9aa9eb9c89e6b3a3eba8a9cb87e6b4b9e291a5eb8f96e68ea9ebbdb0eba38ae6b3a3eb8d8ce294b8
UHC 夜껊씛揖퀱嶪용뜉泣먩ˇ洹⑥돖掩뽰룊泣덌┸ 11100101101010001000001111101011100111011011000011101011111001111011010001000100111001011111010110111111111010111000110110001100111010111110100010010000111001101010001010100111111010101011011110101000111011001000100110100000111001011111001110010110111011001000111110001001111010111110100010001000111011111010011010111111 e5a883eb9db0ebe7b444e5f5bfeb8d8cebe890e6a2a7eab7a8ec89a0e5f396ec8f89ebe888efa6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)