To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鍾テハセミス爾v鍾テハセミス爾vB 1000111111011111110000111100101011110001100011101011111011010000111100011000111010111101100011101010001001110110100011111101111111000011110010101111000110001110101111101101000011110001100011101011110110001110101000100111011001000010 8fdfc3caf18ebed0f18ebd8ea2768fdfc3caf18ebed0f18ebd8ea27642
EUC-JP 鍾テハ?セミ?ス爾v鍾テハ?セミ?ス爾vB 1011111011100001100011101100001110001110110010100011111110001110101111101000111011010000001111111000111010111101101111001010010001110110101111101110000110001110110000111000111011001010001111111000111010111110100011101101000000111111100011101011110110111100101001000111011001000010 bee18ec38eca3f8ebe8ed03f8ebdbca476bee18ec38eca3f8ebe8ed03f8ebdbca47642
UTF-8 鍾テハセミス爾v鍾テハセミス爾vB 111010011000110110111110111011111011111010000011111011111011111010001010111011101000010010001001111011111011110110111110111011111011111010010000111011101000010010001001111011111011110110111101111001111000100010111110011101101110100110001101101111101110111110111110100000111110111110111110100010101110111010000100100010011110111110111101101111101110111110111110100100001110111010000100100010011110111110111101101111011110011110001000101111100111011001000010 e98dbeefbe83efbe8aee8489efbdbeefbe90ee8489efbdbde788be76e98dbeefbe83efbe8aee8489efbdbeefbe90ee8489efbdbde788be7642
UHC 鍾???????爾v鍾???????爾vB 11110001101000110011111100111111001111110011111100111111001111110011111111101100101100110111011011110001101000110011111100111111001111110011111100111111001111110011111111101100101100110111011001000010 f1a33f3f3f3f3f3f3fecb376f1a33f3f3f3f3f3f3fecb37642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)