To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???唯??諭??孃る?悠??淞る?? 0011111100111111001111111001011101000010001111110011111110010111010000000011111100111111100110110110111110000010111010010011111110010111010010010011111100111111100111111100001010000010111010010011111100111111 3f3f3f97423f3f97403f3f9b6f82e93f97493f3f9fc282e93f3f
EUC-JP ???唯??諭??孃る?悠??淞る?孼 00111111001111110011111111001101101000110011111100111111110011011010000100111111001111111101010111010000101001001110101100111111110011011010101000111111001111111101111011000100101001001110101100111111100011111011101011000011 3f3f3fcda33f3fcda13f3fd5d0a4eb3fcdaa3f3fdec4a4eb3f8fbac3
UTF-8 嶺뚢돦唯쎽튃諭꾠룋孃る씮悠띄춯淞る닔孼 111011111010011010101011111010111001101010100010111010111000111110100110111001011001010010101111111011001000111010111101111011011000101010000011111010001010101110101101111010101011111010100000111010111010001110001011111001011010110110000011111000111000001010001011111011001001010010101110111001101000001010100000111010111001110110000100111011001011011010101111111001101011011110011110111000111000001010001011111010111000101110010100111001011010110110111100 efa6abeb9aa2eb8fa6e594afec8ebded8a83e8abadeabea0eba38be5ad83e3828bec94aee682a0eb9d84ecb6afe6b79ee3828beb8b94e5adbc
UHC 嶺뚢돦唯쎽튃諭꾠룋孃る씮悠띄춯淞る닔孼 1110011110101101100011001110001010001001101010101110101011100110100110111110010010111001100110011110101110110001100001001110001110001111100010101110010110111110101010101110101110011101101111111110101011101101101101101110011110101101100011001110000111100111101010101110101110001000100110001110010111101101 e7ad8ce289aaeae69be4b999ebb184e38f8ae5beaaeb9dbfeaedb6e7ad8ce1e7aaeb8898e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)