To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 吟??俺?■蜃?俺? 100010111110000100111111001111111000100110110100001111111000000110100001111001011000011100111111100010011011010000111111 8be13f3f89b43f81a1e5873f89b43f
EUC-JP 吟??俺?■蜃?俺? 101101101110001100111111001111111011001010110110001111111010001010100011111010011110011100111111101100101011011000111111 b6e33f3fb2b63fa2a3e9e73fb2b63f
UTF-8 吟㏘짠俺얕■蜃렏俺슴 111001011001000010011111111000111000111110011000111011001010011110100000111001001011111110111010111011001001011010010101111000101001011010100000111010001001110010000011111010111010000010001111111001001011111110111010111011001000101010110100 e5909fe38f98eca7a0e4bfbaec9695e296a0e89c83eba08fe4bfbaec8ab4
UHC 吟㏘짠俺얕■蜃렏俺슴 1110101111100001101000101110010011000010101001111110010111101111101111101110100010100001111000011110001111110001100011101010010111100101111011111011110110111111 ebe1a2e4c2a7e5efbee8a1e1e3f18ea5e5efbdbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)