To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?將?妖?製?除??棒????趙驀??? 1001001010100010001111111001101110010010001111111001011101100100001111111001000010111011001111111000111110011100001111110011111110010110010111110011111100111111001111110011111111100110111000101110100101111101001111110011111100111111 92a23f9b923f97643f90bb3f8f9c3f3f965f3f3f3f3fe6e2e97d3f3f3f
EUC-JP 弔?將?妖?製?除??棒????趙驀??塼 11000100101001000011111111010101111100100011111111001101110001010011111111000000101111010011111110111101111111000011111100111111110010111100000000111111001111110011111100111111111011001110010011110001110111100011111100111111100011111011100010111001 c4a43fd5f23fcdc53fc0bd3fbdfc3f3fcbc03f3f3f3fece4f1de3f3f8fb8b9
UTF-8 弔렟將렚妖렢製렩除곁렠棒렟렩履렰趙驀렖렕塼 111001011011110010010100111010111010000010011111111001011011000010000111111010111010000010011010111001011010011010010110111010111010000010100010111010001010001110111101111010111010000010101001111010011001100110100100111010101011001110000001111010111010000010100000111001101010001110010010111010111010000010011111111010111010000010101001111011111010011110011111111010111010000010110000111010001011011010011001111010011010100110000000111010111010000010010110111010111010000010010101111001011010000110111100 e5bc94eba09fe5b087eba09ae5a696eba0a2e8a3bdeba0a9e999a4eab381eba0a0e6a392eba09feba0a9efa79feba0b0e8b699e9a980eba096eba095e5a1bc
UHC 弔렟將렚妖렢製렩除곁렠棒렟렩履렰趙驀렖렕塼 111100001100000010001110101100001110110111100010100011101010110111101000111011011000111010110011111100001011001010001110101101111111000010110110101100001110011110001110101100011101110011101010100011101011000010001110101101111110110010101010100011101011110111110000111000011101100011101001100011101010101110001110101010101110111011110100 f0c08eb0ede28eade8ed8eb3f0b28eb7f0b6b0e78eb1dcea8eb08eb7ecaa8ebdf0e1d8e98eab8eaaeef4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)