To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 熬??役?Д節? 111000001001001000111111001111111001011011110000001111111000010001000100100100001101111100111111 e0923f3f96f03f844490df3f
EUC-JP 熬??役?Д節? 110111111111001000111111001111111100110011110010001111111010011110100101110000001110000100111111 dff23f3fccf23fa7a5c0e13f
UTF-8 熬긷돭役닻Д節캺 1110011110000110101011001110101010111000101101111110101110001111101011011110010110111101101110011110101110001011101110111101000010010100111001111010111110000000111011001011101010111010 e786aceab8b7eb8fade5bdb9eb8bbbd094e7af80ecbaba
UHC 熬긷돭役닻Д節캺 11101000101000101011000111100101100010011011000011100110101101011011010011101001101011001010010111101111101111011011000001011010 e8a2b1e589b0e6b5b4e9aca5efbdb05a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)