To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 重???淫臀???駿????淫臀???駿? 10001111011001000011111100111111001111111000100011111010111001000101110000111111001111110011111110001111011110000011111100111111001111110011111110001000111110101110010001011100001111110011111100111111100011110111100000111111 8f643f3f3f88fae45c3f3f3f8f783f3f3f3f88fae45c3f3f3f8f783f
EUC-JP 重?勖?淫臀?勖?駿??勖?淫臀?勖?駿? 101111011100010100111111100011111011001111101101001111111011000011111100111001111011110100111111100011111011001111101101001111111011110111011001001111110011111110001111101100111110110100111111101100001111110011100111101111010011111110001111101100111110110100111111101111011101100100111111 bdc53f8fb3ed3fb0fce7bd3f8fb3ed3fbdd93f3f8fb3ed3fb0fce7bd3f8fb3ed3fbdd93f
UTF-8 重렖勖렓淫臀릎勖렓駿띕릎勖렓淫臀릎勖렓駿동 111010011000011110001101111010111010000010010110111001011000101110010110111010111010000010010011111001101011011110101011111010001000011110000000111010111010011010001110111001011000101110010110111010111010000010010011111010011010011110111111111010111001110110010101111010111010011010001110111001011000101110010110111010111010000010010011111001101011011110101011111010001000011110000000111010111010011010001110111001011000101110010110111010111010000010010011111010011010011110111111111010111000111110011001 e9878deba096e58b96eba093e6b7abe88780eba68ee58b96eba093e9a7bfeb9d95eba68ee58b96eba093e6b7abe88780eba68ee58b96eba093e9a7bfeb8f99
UHC 重렖勖렓淫臀릎勖렓駿띕릎勖렓淫臀릎勖렓駿동 111100011110110010001110101010111110100111101101100011101010100011101011111000101101010011101011101110001010110111101001111011011000111010101000111100011110011110110110111010111011100010101101111010011110110110001110101010001110101111100010110101001110101110111000101011011110100111101101100011101010100011110001111001111011010110111111 f1ec8eabe9ed8ea8ebe2d4ebb8ade9ed8ea8f1e7b6ebb8ade9ed8ea8ebe2d4ebb8ade9ed8ea8f1e7b5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)