To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?щ??や?姙 00111111100001001000101100111111001111111000001011100010001111111001101101001011 3f848b3f3f82e23f9b4b
EUC-JP ?щł?や?姙 001111111010011111101011100011111010100111001000001111111010010011100100001111111101010110101100 3fa7eb8fa9c83fa4e43fd5ac
UTF-8 寧щł溜や옇姙 11101111101001101010101011010001100010011100010110000010111011111010011110001011111000111000001010000100111011001001100010000111111001011010011110011001 efa6aad189c582efa78be38284ec9887e5a799
UHC 寧щł溜や옇姙 1110011110101100101011001110101110101001101010011110101011111110101010101110010010111111101110001110110011110101 e7acaceba9a9eafeaae4bfb8ecf5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)