To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???杖⑨?絶???①?章??絶?????B 00111111001111110011111110001111111100011000011101001000001111111001000011100010001111110011111100111111100001110100000000111111100011111100110100111111001111111001000011100010001111110011111100111111001111110011111101000010 3f3f3f8ff187483f90e23f3f3f87403f8fcd3f3f90e23f3f3f3f3f42
EUC-JP ???杖??絶?????章??絶??琰??B 00111111001111110011111110111110111100110011111100111111110000001110010000111111001111110011111100111111001111111011111011001111001111110011111111000000111001000011111100111111100011111100110010110100001111110011111101000010 3f3f3fbef33f3fc0e43f3f3f3f3fbecf3f3fc0e43f3f8fccb43f3f42
UTF-8 怜뷁뱱杖⑨풅絶쏁왃狀①컮章쀮뒯絶랃풘琰띨맖B 11101111101001101010110011101011101101111000000111101011101100011011000111100110100111011001011011100010100100011010100011101101100100101000010111100111101101011011011011101100100011111000000111101100100110011000001111101111101001111011101011100010100100011010000011101100101110111010111011100111101010111010000011101100100000001010111011101011100100101010111111100111101101011011011011101011100111101000001111101101100100101001100011100111100100001011000011101011100111011010100011101011101001111001011001000010 efa6acebb781ebb1b1e69d96e291a8ed9285e7b5b6ec8f81ec9983efa7bae291a0ecbbaee7aba0ec80aeeb92afe7b5b6eb9e83ed9298e790b0eb9da8eba79642
UHC 怜뷁뱱杖⑨풅絶쏁왃狀①컮章쀮뒯絶랃풘琰띨맖B 11100111101100001001010011101110100100111001011111101101111010001010100011101111101111101000110111101111101111101001101111100111100111101011011011101101111011101010100011100111101100001001010011101101111100011001011111101110100010101010100011101111101111101000110111101111101111101001101111100110111111001011011011101110100100001010100001000010 e7b094ee9397ede8a8efbe8defbe9be79eb6edeea8e7b094edf197ee8aa8efbe8defbe9be6fcb6ee90a842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)