To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???竊??擬??壓?????淫??吾??B 001111110011111100111111111000101000011000111111001111111000101101011011001111110011111110011010110110000011111100111111001111110011111100111111100010001111101000111111001111111000110011100001001111110011111101000010 3f3f3fe2863f3f8b5b3f3f9ad83f3f3f3f3f88fa3f3f8ce13f3f42
EUC-JP ???竊??擬??壓??靷??淫??吾??B 0011111100111111001111111110001111100110001111110011111110110101101111000011111100111111110101001101101000111111001111111000111111100111101111010011111100111111101100001111110000111111001111111011100011100011001111110011111101000010 3f3f3fe3e63f3fb5bc3f3fd4da3f3f8fe7bd3f3fb0fc3f3fb8e33f3f42
UTF-8 捻뀁뮆竊섉윓擬띿젂壓믩뗄靷쀦쾬淫륁쪚吾명깈B 11101111101001101010010011101011100000001000000111101011101011101000011011100111101010111000101011101100100001001000100111101100100111001001001111100110100100111010110011101011100111011011111111101100101000001000001011100101101000111001001111101011101011111010100111101011100101111000010011101001100111011011011111101100100000001010011011101100101111101010110011100110101101111010101111101011101001011000000111101100101010101001101011100101100100001011111011101011101010101000010111101010101110011000100001000010 efa6a4eb8081ebae86e7ab8aec8489ec9c93e693aceb9dbfeca082e5a393ebafa9eb9784e99db7ec80a6ecbeace6b7abeba581ecaa9ae590beebaa85eab98842
UHC 捻뀁뮆竊섉윓擬띿젂壓믩뗄靷쀦쾬淫륁쪚吾명깈B 11100110111101111011001011101100100100101001010111101111101111001001100011100110100111111001101011101011111101001000110111101100101000001000011011100100111000101001001011101011101101101011111111101100111001101001011111100110101100101000001111101011111000101000111111101100101001011001001111100111111011101011100011101101100000111000011101000010 e6f7b2ec9295efbc98e69f9aebf48deca086e4e292ebb6bfece697e6b283ebe28feca593e7eeb8ed838742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)