To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??謙?謙?謙?介 00111111001111111000110010101010001111111000110010101010001111111000110010101010001111111000100111101110 3f3f8caa3f8caa3f8caa3f89ee
EUC-JP ??謙?謙?謙?介 00111111001111111011100010101100001111111011100010101100001111111011100010101100001111111011001011110000 3f3fb8ac3fb8ac3fb8ac3fb2f0
UTF-8 렻旽謙뵀謙旽謙뵀介 111010111010000010111011111001101001011110111101111010001010110010011001111010111011010110000000111010001010110010011001111001101001011110111101111010001010110010011001111010111011010110000000111001001011101110001011 eba0bbe697bde8ac99ebb580e8ac99e697bde8ac99ebb580e4bb8b
UHC 렻旽謙뵀謙旽謙뵀介 100011101100001111010100110001011100110011000101101110101100010111001100110001011101010011000101110011001100010110111010110001011100101110111111 8ec3d4c5ccc5bac5ccc5d4c5ccc5bac5cbbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)