To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蠢?儲??蠢?爰???蠢?儲??蠢?爰?? 1110010110111111001111111001011011010111001111110011111111100101101111110011111111100000101001110011111100111111001111111110010110111111001111111001011011010111001111110011111111100101101111110011111111100000101001110011111100111111 e5bf3f96d73f3fe5bf3fe0a73f3f3fe5bf3f96d73f3fe5bf3fe0a73f3f
EUC-JP 蠢?儲??蠢?爰???蠢?儲??蠢?爰?? 1110101011000001001111111100110011011001001111110011111111101010110000010011111111100000101010010011111100111111001111111110101011000001001111111100110011011001001111110011111111101010110000010011111111100000101010010011111100111111 eac13fccd93f3feac13fe0a93f3f3feac13fccd93f3feac13fe0a93f3f
UTF-8 蠢렎儲쾌㎭蠢렎爰렪綎옵蠢렎儲쾌㎭蠢렎爰렪綎 111010001010000010100010111010111010000010001110111001011000010010110010111011001011111010001100111000111000111010101101111010001010000010100010111010111010000010001110111001111000100010110000111010111010000010101010111001111011011010001110111011001001100010110101111010001010000010100010111010111010000010001110111001011000010010110010111011001011111010001100111000111000111010101101111010001010000010100010111010111010000010001110111001111000100010110000111010111010000010101010111001111011011010001110 e8a0a2eba08ee584b2ecbe8ce38eade8a0a2eba08ee788b0eba0aae7b68eec98b5e8a0a2eba08ee584b2ecbe8ce38eade8a0a2eba08ee788b0eba0aae7b68e
UHC 蠢렎儲쾌㎭蠢렎爰렪綎옵蠢렎儲쾌㎭蠢렎爰렪綎 111100011110001110001110101001001110111010111001110001001110100010100111111000011111000111100011100011101010010011101010101110101000111010111000111011111111001010111111110010011111000111100011100011101010010011101110101110011100010011101000101001111110000111110001111000111000111010100100111010101011101010001110101110001110111111110010 f1e38ea4eeb9c4e8a7e1f1e38ea4eaba8eb8eff2bfc9f1e38ea4eeb9c4e8a7e1f1e38ea4eaba8eb8eff2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)