To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ?る?艤??純??}?る?艤??純??{^ 001111111000001011101001001111111110010001111110001111110011111110001111100000110011111100111111011111010011111110000010111010010011111111100100011111100011111100111111100011111000001100111111001111110111101101011110 3f82e93fe47e3f3f8f833f3f7d3f82e93fe47e3f3f8f833f3f7b5e
EUC-JP ?る?艤??純??}?る?艤??純??{^ 001111111010010011101011001111111110011111011111001111110011111110111101111000110011111100111111011111010011111110100100111010110011111111100111110111110011111100111111101111011110001100111111001111110111101101011110 3fa4eb3fe7df3f3fbde33f3f7d3fa4eb3fe7df3f3fbde33f3f7b5e
UTF-8 閭る틹艤섓쭓純볥뭄}閭る틹艤섓쭓純볥뭄{^ 111011111010011010000110111000111000001010001011111011011000101110111001111010001000100110100100111011001000010010010011111011001010110110010011111001111011010010010100111010111011001110100101111010111010110110000100011111011110111110100110100001101110001110000010100010111110110110001011101110011110100010001001101001001110110010000100100100111110110010101101100100111110011110110100100101001110101110110011101001011110101110101101100001000111101101011110 efa686e3828bed8bb9e889a4ec8493ecad93e7b494ebb3a5ebad847defa686e3828bed8bb9e889a4ec8493ecad93e7b494ebb3a5ebad847b5e
UHC 閭る틹艤섓쭓純볥뭄}閭る틹艤섓쭓純볥뭄{^ 111001101010110110101010111010111011101010011111111010111111101010011000111011111010011110001011111000101110110110010011111010111011100110110011011111011110011010101101101010101110101110111010100111111110101111111010100110001110111110100111100010111110001011101101100100111110101110111001101100110111101101011110 e6adaaebba9febfa98efa78be2ed93ebb9b37de6adaaebba9febfa98efa78be2ed93ebb9b37b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)