To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN 壤??泣①?魏??永??揖??袁?┓gB 10011010110111110011111100111111100010111000001110000111010000000011111111101001101100000011111100111111100010010110100100111111001111111001011101001011001111110011111111100101110011010011111110000100101011010110011101000010 9adf3f3f8b8387403fe9b03f3f89693f3f974b3f3fe5cd3f84ad6742
EUC-JP 壤??泣??魏??永??揖??袁?┓gB 110101001110000100111111001111111011010111100011001111110011111111110010101100100011111100111111101100011100101000111111001111111100110110101100001111110011111111101010110011110011111110101000101011110110011101000010 d4e13f3fb5e33f3ff2b23f3fb1ca3f3fcdac3f3feacf3fa8af6742
UTF-8 壤깆쥜泣①독魏됱댋永띕굢揖먪독袁ㅼ┓gB 1110010110100011101001001110101010111001100001101110110010100101100111001110011010110011101000111110001010010001101000001110101110001111100001011110100110101101100011111110101110010000101100011110101110001100100010111110011010110000101110001110101110011101100101011110101010110101101000101110011010001111100101101110101110101000101010101110101110001111100001011110100010100010100000011110001110000101101111001110001010010100100100110110011101000010 e5a3a4eab986eca59ce6b3a3e291a0eb8f85e9ad8feb90b1eb8c8be6b0b8eb9d95eab5a2e68f96eba8aaeb8f85e8a281e385bce294936742
UHC 壤깆쥜泣①독魏됱댋永띕굢揖먪독袁ㅼ┓gB 1110010110111101101100011110110010100010100100011110101111101000101010001110011110110101101101101110101011100000100010011110110010001000101101001110011110110101101101101110101110000010100010011110101111100111100100001110011110110101101101101110101010111110101001001110110010100110101011110110011101000010 e5bdb1eca291ebe8a8e7b5b6eae089ec88b4e7b5b6eb8289ebe790e7b5b6eabea4eca6af6742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)