To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???厭?????抑???ョ?擬??鵝??^ 001111110011111100111111100010010111110100111111001111110011111100111111001111111001011101111101001111110011111100111111100000111000011100111111100010110101101100111111001111111110101001000000001111110011111101011110 3f3f3f897d3f3f3f3f3f977d3f3f3f83873f8b5b3f3fea403f3f5e
EUC-JP ???厭?????抑???ョ?擬??鵝??^ 001111110011111100111111101100011101111000111111001111110011111100111111001111111100110111011110001111110011111100111111101001011110011100111111101101011011110000111111001111111111001110100001001111110011111101011110 3f3f3fb1de3f3f3f3f3fcdde3f3f3fa5e73fb5bc3f3ff3a13f3f5e
UTF-8 玲곷젷厭묒뼏杻듕퐥抑븀퓖溜ョ뼇擬쀫뼲鵝롦녂^ 11101111101001101010110111101010101100111011011111101100101000001011011111100101100011101010110111101011101011001001001011101011101111001000111111101111101001111000100011101011100100111001010111101101100100001010010111100110100010101001000111101011101110001000000011101101100100111001011011101111101001111000101111100011100000111010011111101011101111001000011111100110100100111010110011101100100000001010101111101011101111001011001011101001101101011001110111101011101000011010011011101011100001011000001001011110 efa6adeab3b7eca0b7e58eadebac92ebbc8fefa788eb9395ed90a5e68a91ebb880ed9396efa78be383a7ebbc87e693acec80abebbcb2e9b59deba1a6eb85825e
UHC 玲곷젷厭묒뼏杻듕퐥抑븀퓖溜ョ뼇擬쀫뼲鵝롦녂^ 11100111101111111000000111101011101000001010101111100110111101001001000111101100100101101001011111101010111101001011010111100100101111011000111011100101111001001011101011100111101111111000000111101010111111101010101111100111100101101001000111101011111101001001011111101011100101101011010111100100101111011000111011100110100001101011101001011110 e7bf81eba0abe6f491ec9697eaf4b5e4bd8ee5e4bae7bf81eafeabe79691ebf497eb96b5e4bd8ee686ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)