To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甑?耳?泣紗??滓?甑?耳?泣紗??滓?^ 10001101100110010011111110001110101010000011111110001011100000111000111011010001001111110011111110011111111001100011111110001101100110010011111110001110101010000011111110001011100000111000111011010001001111110011111110011111111001100011111101011110 8d993f8ea83f8b838ed13f3f9fe63f8d993f8ea83f8b838ed13f3f9fe63f5e
EUC-JP 甑?耳?泣紗??滓?甑?耳?泣紗??滓?^ 10111001111110010011111110111100101010100011111110110101111000111011110011010011001111110011111111011110111010000011111110111001111110010011111110111100101010100011111110110101111000111011110011010011001111110011111111011110111010000011111101011110 b9f93fbcaa3fb5e3bcd33f3fdee83fb9f93fbcaa3fb5e3bcd33f3fdee83f5e
UTF-8 甑렏耳렩泣紗렪렭滓쁩甑렏耳렩泣紗렪렭滓쁠^ 11100111100101001001000111101011101000001000111111101000100000001011001111101011101000001010100111100110101100111010001111100111101101001001011111101011101000001010101011101011101000001010110111100110101110111001001111101100100000011010100111100111100101001001000111101011101000001000111111101000100000001011001111101011101000001010100111100110101100111010001111100111101101001001011111101011101000001010101011101011101000001010110111100110101110111001001111101100100000011010000001011110 e79491eba08fe880b3eba0a9e6b3a3e7b497eba0aaeba0ade6bb93ec81a9e79491eba08fe880b3eba0a9e6b3a3e7b497eba0aaeba0ade6bb93ec81a05e
UHC 甑렏耳렩泣紗렪렭滓쁩甑렏耳렩泣紗렪렭滓쁠^ 1111000111110111100011101010010111101100101111001000111010110111111010111110100011011110111010011000111010111000100011101011101011101110101010111011101111011110111100011111011110001110101001011110110010111100100011101011011111101011111010001101111011101001100011101011100010001110101110101110111010101011101110111101110001011110 f1f78ea5ecbc8eb7ebe8dee98eb88ebaeeabbbdef1f78ea5ecbc8eb7ebe8dee98eb88ebaeeabbbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)