To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????{N}????????{N{^ 0011111100111111001111110011111100111111001111110011111100111111011110110100111001111101001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN 芸???移?依?{N}芸???移?依?{N{^ 1000110001111100001111110011111100111111100010001101101000111111100010001100101100111111011110110100111001111101100011000111110000111111001111110011111110001000110110100011111110001000110010110011111101111011010011100111101101011110 8c7c3f3f3f88da3f88cb3f7b4e7d8c7c3f3f3f88da3f88cb3f7b4e7b5e
EUC-JP 芸?勖?移?依?{N}芸?勖?移?依?{N{^ 101101111101110100111111100011111011001111101101001111111011000011011100001111111011000011001101001111110111101101001110011111011011011111011101001111111000111110110011111011010011111110110000110111000011111110110000110011010011111101111011010011100111101101011110 b7dd3f8fb3ed3fb0dc3fb0cd3f7b4e7db7dd3f8fb3ed3fb0dc3fb0cd3f7b4e7b5e
UTF-8 芸렑勖렢移렊依렋{N}芸렑勖렢移렊依렋{N{^ 11101000100010101011100011101011101000001001000111100101100010111001011011101011101000001010001011100111101001111011101111101011101000001000101011100100101111101001110111101011101000001000101101111011010011100111110111101000100010101011100011101011101000001001000111100101100010111001011011101011101000001010001011100111101001111011101111101011101000001000101011100100101111101001110111101011101000001000101101111011010011100111101101011110 e88ab8eba091e58b96eba0a2e7a7bbeba08ae4be9deba08b7b4e7de88ab8eba091e58b96eba0a2e7a7bbeba08ae4be9deba08b7b4e7b5e
UHC 芸렑勖렢移렊依렋{N}芸렑勖렢移렊依렋{N{^ 111010011111110110001110101001101110100111101101100011101011001111101100101110011000111010100001111010111110111010001110101000100111101101001110011111011110100111111101100011101010011011101001111011011000111010110011111011001011100110001110101000011110101111101110100011101010001001111011010011100111101101011110 e9fd8ea6e9ed8eb3ecb98ea1ebee8ea27b4e7de9fd8ea6e9ed8eb3ecb98ea1ebee8ea27b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)