To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 瓮??業??狎??[瓮??業??狎??[^ 111000010100010000111111001111111000101111000110001111110011111111100000101111100011111100111111010110111110000101000100001111110011111110001011110001100011111100111111111000001011111000111111001111110101101101011110 e1443f3f8bc63f3fe0be3f3f5be1443f3f8bc63f3fe0be3f3f5b5e
EUC-JP 瓮??業??狎??[瓮??業??狎??[^ 111000011010010100111111001111111011011011001000001111110011111111100000110000000011111100111111010110111110000110100101001111110011111110110110110010000011111100111111111000001100000000111111001111110101101101011110 e1a53f3fb6c83f3fe0c03f3f5be1a53f3fb6c83f3fe0c03f3f5b5e
UTF-8 瓮뗥넇業쇠톷狎쀦샎[瓮뗥넇業쇠톷狎쀦샎[^ 111001111001001110101110111010111001011110100101111010111000010010000111111001101010010110101101111011001000011110100000111011011000011010110111111001111000101110001110111011001000000010100110111011001000001110001110010110111110011110010011101011101110101110010111101001011110101110000100100001111110011010100101101011011110110010000111101000001110110110000110101101111110011110001011100011101110110010000000101001101110110010000011100011100101101101011110 e793aeeb97a5eb8487e6a5adec87a0ed86b7e78b8eec80a6ec838e5be793aeeb97a5eb8487e6a5adec87a0ed86b7e78b8eec80a6ec838e5b5e
UHC 瓮뗥넇業쇠톷狎쀦샎[瓮뗥넇業쇠톷狎쀦샎[^ 111010001011011110001011111001011000011010010111111001011111011010111100111010001011011110001011111001001110010010010111111001101001100010111100010110111110100010110111100010111110010110000110100101111110010111110110101111001110100010110111100010111110010011100100100101111110011010011000101111000101101101011110 e8b78be58697e5f6bce8b78be4e497e698bc5be8b78be58697e5f6bce8b78be4e497e698bc5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)