To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??旭?垣?伊頭?豆?矜?豆 001111110011111110001000101011100011111110001010010111110011111110001000110010011001001110101010001111111001001110100100001111111110000111100000001111111001001110100100 3f3f88ae3f8a5f3f88c993aa3f93a43fe1e03f93a4
EUC-JP ??旭?垣?伊頭?豆?矜?豆 001111110011111110110000101100000011111110110011110000000011111110110000110010111100011010101100001111111100011010100110001111111110001011100010001111111100011010100110 3f3fb0b03fb3c03fb0cbc6ac3fc6a63fe2e23fc6a6
UTF-8 亐렕旭렔垣렖伊頭렧豆렚矜렍豆 111001001011101010010000111010111010000010010101111001101001011110101101111010111010000010010100111001011001111010100011111010111010000010010110111001001011110010001010111010011010000010101101111010111010000010100111111010001011000110000110111010111010000010011010111001111001111110011100111010111010000010001101111010001011000110000110 e4ba90eba095e697adeba094e59ea3eba096e4bc8ae9a0adeba0a7e8b186eba09ae79f9ceba08de8b186
UHC 亐렕旭렔垣렖伊頭렧豆렚矜렍豆 11101010101001111000111010101010111010011110111110001110101010011110101010101111100011101010101111101100101001011101010011101001100011101011011011010100111001111000111010101101110100001110100010001110101000111101010011100111 eaa78eaae9ef8ea9eaaf8eabeca5d4e98eb6d4e78eadd0e88ea3d4e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)