To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?霽?趙陌?豆?兢?矜?虞??????? 1110001101110001001111111110100011000111001111111110011011100010111010001001100100111111100100111010010000111111100110010101110100111111111000011110000000111111100010111111000100111111001111110011111100111111001111110011111100111111 e3713fe8c73fe6e2e8993f93a43f995d3fe1e03f8bf13f3f3f3f3f3f3f
EUC-JP 縡?霽?趙陌?豆?兢?矜芷虞?????塼? 111001011101001000111111111100001100100100111111111011001110010011101111111110010011111111000110101001100011111111010001101111100011111111100010111000101000111111010111110010011011011011110011001111110011111100111111001111110011111110001111101110001011100100111111 e5d23ff0c93fece4eff93fc6a63fd1be3fe2e28fd7c9b6f33f3f3f3f3f8fb8b93f
UTF-8 縡렕霽렢趙陌렍豆렚兢렚矜芷虞렧柳양렏렕塼렦 111001111011100010100001111010111010000010010101111010011001110010111101111010111010000010100010111010001011011010011001111010011001100110001100111010111010000010001101111010001011000110000110111010111010000010011010111001011000010110100010111010111010000010011010111001111001111110011100111010001000101010110111111010001001100110011110111010111010000010100111111011111010011110001001111011001001011010010001111010111010000010001111111010111010000010010101111001011010000110111100111010111010000010100110 e7b8a1eba095e99cbdeba0a2e8b699e9998ceba08de8b186eba09ae585a2eba09ae79f9ce88ab7e8999eeba0a7efa789ec9691eba08feba095e5a1bceba0a6
UHC 縡렕霽렢趙陌렍豆렚兢렚矜芷虞렧柳양렏렕塼렦 111011101010110110001110101010101111000010111000100011101011001111110000111000011101100011101000100011101010001111010100111001111000111010101101110100001110011110001110101011011101000011101000111100101011101011101001111001011000111010110110111010101111011110111110111001111000111010100101100011101010101011101110111101001000111010110101 eead8eaaf0b88eb3f0e1d8e88ea3d4e78eadd0e78eadd0e8f2bae9e58eb6eaf7bee78ea58eaaeef48eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)