To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦?????怨??魚??揖??轅??烏k? 10010110100100100011111100111111001111110011111100111111100010011000010100111111001111111000101110011011001111110011111110010111010010110011111100111111111001110111011000111111001111111000100101000111100000101000101100111111 96923f3f3f3f3f89853f3f8b9b3f3f974b3f3fe7763f3f8947828b3f
EUC-JP 亦?????怨??魚??揖??轅??烏k? 11001011111100100011111100111111001111110011111100111111101100011110010100111111001111111011010111111011001111110011111111001101101011000011111100111111111011011101011100111111001111111011000110101000101000111110101100111111 cbf23f3f3f3f3fb1e53f3fb5fb3f3fcdac3f3fedd73f3fb1a8a3eb3f
UTF-8 亦껁끆璘뺟윢怨뺣젲魚잙쉴揖쇔젆轅⑸짎烏k냄 111001001011101010100110111010101011101110000001111010111000000110000110111011111010011110101111111010111011101010011111111011001001110010100010111001101000000010101000111010111011101010100011111011001010000010110010111010011010110110011010111011001001111010011001111011001000100110110100111001101000111110010110111011001000011110010100111011001010000010000110111010001011110110000101111000101001000110111000111011001010011110001110111001111000001110001111111011111011110110001011111010111000001110000100 e4baa6eabb81eb8186efa7afebba9fec9ca2e680a8ebbaa3eca0b2e9ad9aec9e99ec89b4e68f96ec8794eca086e8bd85e291b8eca78ee7838fefbd8beb8384
UHC 亦껁끆璘뺟윢怨뺣젲魚잙쉴揖쇔젆轅⑸짎烏k냄 111001101011001010000011111000111000010110111010111011001101111010010101111001111001111110100011111010101011001110010101111010111010000010100110111001011110000010011111111010111011110110101111111010111110011110111100111001011010000010001001111010101011111110101001111010111010001110011010111010001010000110100011111010111011001110111111 e6b283e385baecde95e79fa3eab395eba0a6e5e09febbdafebe7bce5a089eabfa9eba39ae8a1a3ebb3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)