To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????}??????????{^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101111101001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 趙???彫豆??忽?}趙???彫豆??忽?{^ 11100110111000100011111100111111001111111001001010100100100100111010010000111111001111111000110110011010001111110111110111100110111000100011111100111111001111111001001010100100100100111010010000111111001111111000110110011010001111110111101101011110 e6e23f3f3f92a493a43f3f8d9a3f7de6e23f3f3f92a493a43f3f8d9a3f7b5e
EUC-JP 趙???彫豆??忽?}趙???彫豆??忽?{^ 11101100111001000011111100111111001111111100010010100110110001101010011000111111001111111011100111111010001111110111110111101100111001000011111100111111001111111100010010100110110001101010011000111111001111111011100111111010001111110111101101011110 ece43f3f3fc4a6c6a63f3fb9fa3f7dece43f3f3fc4a6c6a63f3fb9fa3f7b5e
UTF-8 趙쿰렰렦彫豆맑뤚忽줻}趙쿰렰렦彫豆맑뤚忽줻{^ 111010001011011010011001111011001011111110110000111010111010000010110000111010111010000010100110111001011011110110101011111010001011000110000110111010111010011110010001111010111010010010011010111001011011111110111101111011001010010010111011011111011110100010110110100110011110110010111111101100001110101110100000101100001110101110100000101001101110010110111101101010111110100010110001100001101110101110100111100100011110101110100100100110101110010110111111101111011110110010100100101110110111101101011110 e8b699ecbfb0eba0b0eba0a6e5bdabe8b186eba791eba49ae5bfbdeca4bb7de8b699ecbfb0eba0b0eba0a6e5bdabe8b186eba791eba49ae5bfbdeca4bb7b5e
UHC 趙쿰렰렦彫豆맑뤚忽줻}趙쿰렰렦彫豆맑뤚忽줻{^ 11110000111000011100010011110001100011101011110110001110101101011111000011000001110101001110011110111000101111001000111111001001111110111110110010100010011011100111110111110000111000011100010011110001100011101011110110001110101101011111000011000001110101001110011110111000101111001000111111001001111110111110110010100010011011100111101101011110 f0e1c4f18ebd8eb5f0c1d4e7b8bc8fc9fbeca26e7df0e1c4f18ebd8eb5f0c1d4e7b8bc8fc9fbeca26e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)