To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壓??椅ゆ?幽??筌??悠??濡??愉 100110101101100000111111001111111000100011010110100000101110010000111111100101110100100000111111001111111110001010100011001111110011111110010111010010010011111100111111100101000100011100111111001111111001011011111001 9ad83f3f88d682e43f97483f3fe2a33f3f97493f3f94473f3f96f9
EUC-JP 壓??椅ゆ?幽??筌??悠??濡??愉 110101001101101000111111001111111011000011011000101001001110011000111111110011011010100100111111001111111110010010100101001111110011111111001101101010100011111100111111110001111010100000111111001111111100110011111011 d4da3f3fb0d8a4e63fcda93f3fe4a53f3fcdaa3f3fc7a83f3fccfb
UTF-8 壓쇰낄椅ゆ룚幽껊쭏筌먦룂悠뽫뙠濡뗫씮愉 111001011010001110010011111011001000011110110000111010111000001010000100111001101010010010000101111000111000001010000110111010111010001110011010111001011011100110111101111010101011101110001010111011001010110110001111111001111010110110001100111010111010100010100110111010111010001110000010111001101000001010100000111010111011110110101011111010111001100110100000111001101011111110100001111010111001011110101011111011001001010010101110111001101000010010001001 e5a393ec87b0eb8284e6a485e38286eba39ae5b9bdeabb8aecad8fe7ad8ceba8a6eba382e682a0ebbdabeb99a0e6bfa1eb97abec94aee68489
UHC 壓쇰낄椅ゆ룚幽껊쭏筌먦룂悠뽫뙠濡뗫씮愉 1110010011100010101111001110101110110011101001011110101111110101101010101110011010001111100101101110101011101011100000111110101110100111100010001110111110100111100100001110001110001111100000111110101011101101100101101110011110001100101001011110101110100001100010111110101110011101101111111110101011110000 e4e2bcebb3a5ebf5aae68f96eaeb83eba788efa790e38f83eaed96e78ca5eba18beb9dbfeaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)