To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??宋??嚥▲?猷??擬??億 111000011001111100111111001111111000101101011000001111110011111110010001011101100011111100111111100110101000101110000001101000110011111110010111010100010011111100111111100010110101101100111111001111111000100110101101 e19f3f3f8b583f3f91763f3f9a8b81a33f97513f3f8b5b3f3f89ad
EUC-JP 癲??宜??宋??嚥▲?猷??擬??億 111000101010000100111111001111111011010110111001001111110011111111000001110101110011111100111111110100111110101110100010101001010011111111001101101100100011111100111111101101011011110000111111001111111011001010101111 e2a13f3fb5b93f3fc1d73f3fd3eba2a53fcdb23f3fb5bc3f3fb2af
UTF-8 癲덈챶宜방뉘宋볥젧嚥▲룗猷녽뒽擬쒕깹億 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111011000010101001111010111000100110011000111001011010111010001011111010111011001110100101111011001010000010100111111001011001101010100101111000101001011010110010111010111010001110010111111001111000110010110111111010111000010110111101111010111001001010111101111001101001001110101100111011001001001010010101111010101011100110111001111001011000010010000100 e799b2eb8d88ecb1b6e5ae9cebb0a9eb8998e5ae8bebb3a5eca0a7e59aa5e296b2eba397e78cb7eb85bdeb92bde693acec9295eab9b9e58484
UHC 癲덈챶宜방뉘宋볥젧嚥▲룗猷녽뒽擬쒕깹億 1110111110100110100010001110101110101010100000111110101111110001101110011110011010110100101101011110000111100100100100111110101110100000100111111110011010111111101000011110001110001111100100111110101110100011100001101110100110001010101100111110101111110100100111001110101110110010101000011110010111100010 efa688ebaa83ebf1b9e6b4b5e1e493eba09fe6bfa1e38f93eba386e98ab3ebf49cebb2a1e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)