To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?罕漿?枇?瀨??^ 0011111111100011101001011001111111110111001111111001010011111000001111111111101101010000001111110011111101011110 3fe3a59ff73f94f83ffb503f3f5e
EUC-JP ?罕漿?枇????^ 00111111111001101010011111011110111110010011111111001000111110100011111100111111001111110011111101011110 3fe6a7def93fc8fa3f3f3f3f5e
UTF-8 뤋罕漿㈎枇샘瀨렔외^ 11101011101001001000101111100111101111011001010111100110101111001011111111100011100010001000111011100110100111101000011111101100100000111001100011100111100000001010100011101011101000001001010011101100100110011011100001011110 eba48be7bd95e6bcbfe3888ee69e87ec8398e780a8eba094ec99b85e
UHC 뤋罕漿㈎枇샘瀨렔외^ 10001111101110111111100111010110111011011110110010101001101111111101110111101101101110111111100111010110111011101000111010101001101111111101110001011110 8fbbf9d6edeca9bfddedbbf9d6ee8ea9bfdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)