To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シ眈ミ疾執濵釈シミ疾執鄙 101111001110000110111100110100001000111010111110111100111110111110001110101101111111101101001101100011101101111110111100110100001000111010111110111100111110111110001110101101111110011110111111 bce1bcd08ebef3ef8eb7fb4d8edfbcd08ebef3ef8eb7e7bf
EUC-JP シ眈ミ疾?執濵釈シミ疾?執鄙 100011101011110011100010101111101000111011010000101111001100000000111111101111001011100110001111110010011010011010111100111000011000111010111100100011101101000010111100110000000011111110111100101110011110111011000001 8ebce2be8ed0bcc03fbcb98fc9a6bce18ebc8ed0bcc03fbcb9eec1
UTF-8 シ眈ミ疾執濵釈シミ疾執鄙 111011111011110110111100111001111001110010001000111011111011111010010000111001111001011010111110111011101000101110100010111001011001111110110111111001101011111110110101111010011000011110001000111011111011110110111100111011111011111010010000111001111001011010111110111011101000101110100010111001011001111110110111111010011000010010011001 efbdbce79c88efbe90e796beee8ba2e59fb7e6bfb5e98788efbdbcefbe90e796beee8ba2e59fb7e98499
UHC ?眈?疾?執????疾?執鄙 0011111111110111101011110011111111110010111100000011111111110010111110110011111100111111001111110011111111110010111100000011111111110010111110111101111010101001 3ff7af3ff2f03ff2fb3f3f3f3ff2f03ff2fbdea9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)