To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?霽?鷹???寃???????????除 1001111111000100001111111110100011000111001111111001000111101001001111110011111100111111100110111000001100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000111110011100 9fc43fe8c73f91e93f3f3f9b833f3f3f3f3f3f3f3f3f3f3f8f9c
EUC-JP 淨?霽?鷹???寃?????勖?????除 11011110110001100011111111110000110010010011111111000010111010110011111100111111001111111101010111100011001111110011111100111111001111110011111110001111101100111110110100111111001111110011111100111111001111111011110111111100 dec63ff0c93fc2eb3f3f3fd5e33f3f3f3f3f8fb3ed3f3f3f3f3fbdfc
UTF-8 淨렠霽렢鷹꿸렟렩寃닿렣곌렟렩勖쾅렟닻렖렕除 111001101011011110101000111010111010000010100000111010011001110010111101111010111010000010100010111010011011011110111001111010101011111110111000111010111010000010011111111010111010000010101001111001011010111110000011111010111000101110111111111010111010000010100011111010101011001110001100111010111010000010011111111010111010000010101001111001011000101110010110111011001011111010000101111010111010000010011111111010111000101110111011111010111010000010010110111010111010000010010101111010011001100110100100 e6b7a8eba0a0e99cbdeba0a2e9b7b9eabfb8eba09feba0a9e5af83eb8bbfeba0a3eab38ceba09feba0a9e58b96ecbe85eba09feb8bbbeba096eba095e999a4
UHC 淨렠霽렢鷹꿸렟렩寃닿렣곌렟렩勖쾅렟닻렖렕除 111011111110010010001110101100011111000010111000100011101011001111101011111011011011001011101010100011101011000010001110101101111110101010110010101101001110101010001110101101001011000011101010100011101011000010001110101101111110100111101101110001001110011110001110101100001011010011101001100011101010101110001110101010101111000010110110 efe48eb1f0b88eb3ebedb2ea8eb08eb7eab2b4ea8eb4b0ea8eb08eb7e9edc4e78eb0b4e98eab8eaaf0b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)