To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?學孩?秘林邯え?^ 00111111100110110111101110011011011101110011111110010100111010011001011111010001111001111011011010000010101001100011111101011110 3f9b7b9b773f94e997d1e7b682a63f5e
EUC-JP ?學孩?秘林邯え?^ 00111111110101011101110011010101110110000011111111001000111010111100111011010011111011101011100010100100101010000011111101011110 3fd5dcd5d83fc8ebced3eeb8a4a83f5e
UTF-8 뤋學孩㉠秘林邯え긁^ 11101011101001001000101111100101101011011011100011100101101011011010100111100011100010011010000011100111101001111001100011100110100111101001011111101001100000101010111111100011100000011000100011101010101110001000000101011110 eba48be5adb8e5ada9e389a0e7a798e69e97e982afe38188eab8815e
UHC 뤋學孩㉠秘林邯え긁^ 10001111101110111111100111001010111110101010100110101000101100011101110111111010110101111111100111001010111110111010101010101000101100011101110001011110 8fbbf9cafaa9a8b1ddfad7f9cafbaaa8b1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)