To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??揖??喩??? 10001000101000110011111100111111100101110100101100111111001111111001101001100111001111110011111100111111 88a33f3f974b3f3f9a673f3f3f
EUC-JP 哀??揖??喩??? 10110000101001010011111100111111110011011010110000111111001111111101001111001000001111110011111100111111 b0a53f3fcdac3f3fd3c83f3f3f
UTF-8 哀노콈揖욕죰喩쏆뒌閱 111001011001001110000000111010111000010110111000111011001011110110001000111001101000111110010110111011001001101010010101111011001010001110110000111001011001011010101001111011001000111110000110111010111001001010001100111010011001011010110001 e59380eb85b8ecbd88e68f96ec9a95eca3b0e596a9ec8f86eb928ce996b1
UHC 哀노콈揖욕죰喩쏆뒌閱 1110010011101110101100111110101110110001100001001110101111100111101111111110010110100001100010111110101011100111100110111110110010001010100010011110011011110011 e4eeb3ebb184ebe7bfe5a18beae79bec8a89e6f3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)