To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????m}????????m{^ 001111110011111100111111001111110011111100111111001111110011111101101101011111010011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN ?呼ф傲?瀕??m}?呼ф傲?瀕??m{^ 0011111110001100110001001000010010000110100110001111110000111111100101010110110100111111001111110110110101111101001111111000110011000100100001001000011010011000111111000011111110010101011011010011111100111111011011010111101101011110 3f8cc4848698fc3f956d3f3f6d7d3f8cc4848698fc3f956d3f3f6d7b5e
EUC-JP ?呼ф傲?瀕??m}?呼ф傲?瀕??m{^ 0011111110111000110001101010011111100110110100001111111000111111110010011100111000111111001111110110110101111101001111111011100011000110101001111110011011010000111111100011111111001001110011100011111100111111011011010111101101011110 3fb8c6a7e6d0fe3fc9ce3f3f6d7d3fb8c6a7e6d0fe3fc9ce3f3f6d7b5e
UTF-8 뤿呼ф傲혧瀕렧센m}뤿呼ф傲혧瀕렧센m{^ 111010111010010010111111111001011001000110111100110100011000010011100101100000101011001011101101100110001010011111100111100000001001010111101011101000001010011111101100100001001011110001101101011111011110101110100100101111111110010110010001101111001101000110000100111001011000001010110010111011011001100010100111111001111000000010010101111010111010000010100111111011001000010010111100011011010111101101011110 eba4bfe591bcd184e582b2ed98a7e78095eba0a7ec84bc6d7deba4bfe591bcd184e582b2ed98a7e78095eba0a7ec84bc6d7b5e
UHC 뤿呼ф傲혧瀕렧센m}뤿呼ф傲혧瀕렧센m{^ 10001111111010111111101110111100101011001110011011100111111011001100001010001111110111101011010110001110101101101011110010111110011011010111110110001111111010111111101110111100101011001110011011100111111011001100001010001111110111101011010110001110101101101011110010111110011011010111101101011110 8febfbbcace6e7ecc28fdeb58eb6bcbe6d7d8febfbbcace6e7ecc28fdeb58eb6bcbe6d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)