To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ?щ?預???k? 001111111000010010001011001111111001011101100001001111110011111100111111100000101000101100111111 3f848b3f97613f3f3f828b3f
EUC-JP ?щ?預???k? 001111111010011111101011001111111100110111000010001111110011111100111111101000111110101100111111 3fa7eb3fcdc23f3f3fa3eb3f
UTF-8 寧щ젦預뗥낡溜k젶 1110111110100110101010101101000110001001111011001010000010100110111010011010000010010000111010111001011110100101111010111000001010100001111011111010011110001011111011111011110110001011111011001010000010110110 efa6aad189eca0a6e9a090eb97a5eb82a1efa78befbd8beca0b6
UHC 寧щ젦預뗥낡溜k젶 111001111010110010101100111010111010000010011110111001111110100010001011111001011011001110110000111010101111111010100011111010111010000010101010 e7acaceba09ee7e88be5b3b0eafea3eba0aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)