To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???唯??諭??筌??茵??淞る?俉?? 00111111001111110011111110010111010000100011111100111111100101110100000000111111001111111110001010100011001111110011111111100100100111110011111100111111100111111100001010000010111010010011111111111010011000010011111100111111 3f3f3f97423f3f97403f3fe2a33f3fe49f3f3f9fc282e93ffa613f3f
EUC-JP ???唯??諭??筌??茵??淞る?俉?? 0011111100111111001111111100110110100011001111110011111111001101101000010011111100111111111001001010010100111111001111111110100010100001001111110011111111011110110001001010010011101011001111111000111110110001101110110011111100111111 3f3f3fcda33f3fcda13f3fe4a53f3fe8a13f3fdec4a4eb3f8fb1bb3f3f
UTF-8 嶺뚢돦唯쎿룚諭꾠룋筌믨퀗茵먪춯淞る닔俉묎늉 111011111010011010101011111010111001101010100010111010111000111110100110111001011001010010101111111011001000111010111111111010111010001110011010111010001010101110101101111010101011111010100000111010111010001110001011111001111010110110001100111010111010111110101000111011011000000010010111111010001000110010110101111010111010100010101010111011001011011010101111111001101011011110011110111000111000001010001011111010111000101110010100111001001011111110001001111010111010110010001110111010111000101010001001 efa6abeb9aa2eb8fa6e594afec8ebfeba39ae8abadeabea0eba38be7ad8cebafa8ed8097e88cb5eba8aaecb6afe6b79ee3828beb8b94e4bf89ebac8eeb8a89
UHC 嶺뚢돦唯쎿룚諭꾠룋筌믨퀗茵먪춯淞る닔俉묎늉 111001111010110110001100111000101000100110101010111010101110011010011011111001101000111110010110111010111011000110000100111000111000111110001010111011111010011110010010111010101011001110001100111011001110000010010000111001111010110110001100111000011110011110101010111010111000100010011000111001111110101110010001111010101011010010111111 e7ad8ce289aaeae69be68f96ebb184e38f8aefa792eab38cece090e7ad8ce1e7aaeb8898e7eb91eab4bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)