To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?弔?虞?製???窪狡??梯???梯?? 1001001010100010001111111001001010100010001111111000101111110001001111111001000010111011001111110011111100111111100011000100010111100000110000100011111100111111100100101111001000111111001111110011111110010010111100100011111100111111 92a23f92a23f8bf13f90bb3f3f3f8c45e0c23f3f92f23f3f3f92f23f3f
EUC-JP 弔?弔?虞?製?勖?窪狡??梯???梯?釪 110001001010010000111111110001001010010000111111101101101111001100111111110000001011110100111111100011111011001111101101001111111011011110100110111000001100010000111111001111111100010011110100001111110011111100111111110001001111010000111111100011111110001110101101 c4a43fc4a43fb6f33fc0bd3f8fb3ed3fb7a6e0c43f3fc4f43f3f3fc4f43f8fe3ad
UTF-8 弔렟弔렟虞렧製렩勖렢窪狡렟렩梯렟罹렗梯렟釪 111001011011110010010100111010111010000010011111111001011011110010010100111010111010000010011111111010001001100110011110111010111010000010100111111010001010001110111101111010111010000010101001111001011000101110010110111010111010000010100010111001111010101010101010111001111000101110100001111010111010000010011111111010111010000010101001111001101010001010101111111010111010000010011111111011111010011110100110111010111010000010010111111001101010001010101111111010111010000010011111111010011000011110101010 e5bc94eba09fe5bc94eba09fe8999eeba0a7e8a3bdeba0a9e58b96eba0a2e7aaaae78ba1eba09feba0a9e6a2afeba09fefa7a6eba097e6a2afeba09fe987aa
UHC 弔렟弔렟虞렧製렩勖렢窪狡렟렩梯렟罹렗梯렟釪 111100001100000010001110101100001111000011000000100011101011000011101001111001011000111010110110111100001011001010001110101101111110100111101101100011101011001111101000110000011100111011101010100011101011000010001110101101111111000010101100100011101011000011101100101110101000111010101100111100001010110010001110101100001110100111101001 f0c08eb0f0c08eb0e9e58eb6f0b28eb7e9ed8eb3e8c1ceea8eb08eb7f0ac8eb0ecba8eacf0ac8eb0e9e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)