To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?脹????盲??}v?脹????盲??}vB 001111111001001010101111001111110011111100111111001111111001011011010011001111110011111101111101011101100011111110010010101011110011111100111111001111110011111110010110110100110011111100111111011111010111011001000010 3f92af3f3f3f3f96d33f3f7d763f92af3f3f3f3f96d33f3f7d7642
EUC-JP ?脹????盲??}v?脹????盲??}vB 001111111100010010110001001111110011111100111111001111111100110011010101001111110011111101111101011101100011111111000100101100010011111100111111001111110011111111001100110101010011111100111111011111010111011001000010 3fc4b13f3f3f3fccd53f3f7d763fc4b13f3f3f3fccd53f3f7d7642
UTF-8 뤋脹쭗샘어렡盲렡렚}v뤋脹쭗샘어렡盲렡렚}vB 1110101110100100100010111110100010000100101110011110110010101101100101111110110010000011100110001110110010010110101101001110101110100000101000011110011110011011101100101110101110100000101000011110101110100000100110100111110101110110111010111010010010001011111010001000010010111001111011001010110110010111111011001000001110011000111011001001011010110100111010111010000010100001111001111001101110110010111010111010000010100001111010111010000010011010011111010111011001000010 eba48be884b9ecad97ec8398ec96b4eba0a1e79bb2eba0a1eba09a7d76eba48be884b9ecad97ec8398ec96b4eba0a1e79bb2eba0a1eba09a7d7642
UHC 뤋脹쭗샘어렡盲렡렚}v뤋脹쭗샘어렡盲렡렚}vB 1000111110111011111100111110110010100111100011111011101111111001101111101110111010001110101100101101100011101110100011101011001010001110101011010111110101110110100011111011101111110011111011001010011110001111101110111111100110111110111011101000111010110010110110001110111010001110101100101000111010101101011111010111011001000010 8fbbf3eca78fbbf9beee8eb2d8ee8eb28ead7d768fbbf3eca78fbbf9beee8eb2d8ee8eb28ead7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)