To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?醍??伊豆?兢????制 1110001101110001001111111001000111100111001111110011111110001000110010011001001110100100001111111001100101011101001111110011111100111111001111111001000010100111 e3713f91e73f3f88c993a43f995d3f3f3f3f90a7
EUC-JP 縡?醍?汶伊豆?兢????制 11100101110100100011111111000010111010010011111110001111110001101110010110110000110010111100011010100110001111111101000110111110001111110011111100111111001111111100000010101001 e5d23fc2e93f8fc6e5b0cbc6a63fd1be3f3f3f3fc0a9
UTF-8 縡렕醍닺汶伊豆렚兢렟닿렟렩制 111001111011100010100001111010111010000010010101111010011000011010001101111010111000101110111010111001101011000110110110111001001011110010001010111010001011000110000110111010111010000010011010111001011000010110100010111010111010000010011111111010111000101110111111111010111010000010011111111010111010000010101001111001011000100010110110 e7b8a1eba095e9868deb8bbae6b1b6e4bc8ae8b186eba09ae585a2eba09feb8bbfeba09feba0a9e588b6
UHC 縡렕醍닺汶伊豆렚兢렟닿렟렩制 11101110101011011000111010101010111100001011010110110100111010001101101010100001111011001010010111010100111001111000111010101101110100001110011110001110101100001011010011101010100011101011000010001110101101111111000010100100 eead8eaaf0b5b4e8daa1eca5d4e78eadd0e78eb0b4ea8eb08eb7f0a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)