To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 沃??弱?ぐ癌??n}沃??弱?ぐ癌??n{^ 10010111100000000011111100111111100011101110001100111111100000101010111010001010111000000011111100111111011011100111110110010111100000000011111100111111100011101110001100111111100000101010111010001010111000000011111100111111011011100111101101011110 97803f3f8ee33f82ae8ae03f3f6e7d97803f3f8ee33f82ae8ae03f3f6e7b5e
EUC-JP 沃??弱?ぐ癌??n}沃??弱?ぐ癌??n{^ 11001101111000000011111100111111101111001110010100111111101001001011000010110100111000100011111100111111011011100111110111001101111000000011111100111111101111001110010100111111101001001011000010110100111000100011111100111111011011100111101101011110 cde03f3fbce53fa4b0b4e23f3f6e7dcde03f3fbce53fa4b0b4e23f3f6e7b5e
UTF-8 沃겼겢弱딂ぐ癌닷컟n}沃겼겢弱딂ぐ癌닷컟n{^ 1110011010110010100000111110101010110010101111001110101010110010101000101110010110111100101100011110101110010100100000101110001110000001100100001110011110011001100011001110101110001011101101111110110010111011100111110110111001111101111001101011001010000011111010101011001010111100111010101011001010100010111001011011110010110001111010111001010010000010111000111000000110010000111001111001100110001100111010111000101110110111111011001011101110011111011011100111101101011110 e6b283eab2bceab2a2e5bcb1eb9482e38190e7998ceb8bb7ecbb9f6e7de6b283eab2bceab2a2e5bcb1eb9482e38190e7998ceb8bb7ecbb9f6e7b5e
UHC 沃겼겢弱딂ぐ癌닷컟n}沃겼겢弱딂ぐ癌닷컟n{^ 1110100010101010101100001110010110000001101101001110010110110000100010101110100010101010101100001110010011011111101101001110010110110000100010100110111001111101111010001010101010110000111001011000000110110100111001011011000010001010111010001010101010110000111001001101111110110100111001011011000010001010011011100111101101011110 e8aab0e581b4e5b08ae8aab0e4dfb4e5b08a6e7de8aab0e581b4e5b08ae8aab0e4dfb4e5b08a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)