To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 狎??歪??獰??[狎??歪??獰??[^ 111000001011111000111111001111111001100001100011001111110011111111100000110101100011111100111111010110111110000010111110001111110011111110011000011000110011111100111111111000001101011000111111001111110101101101011110 e0be3f3f98633f3fe0d63f3f5be0be3f3f98633f3fe0d63f3f5b5e
EUC-JP 狎??歪??獰??[狎??歪??獰??[^ 111000001100000000111111001111111100111111000100001111110011111111100000110110000011111100111111010110111110000011000000001111110011111111001111110001000011111100111111111000001101100000111111001111110101101101011110 e0c03f3fcfc43f3fe0d83f3f5be0c03f3fcfc43f3fe0d83f3f5b5e
UTF-8 狎볡푻歪긷넃獰뉔궞[狎볡푻歪긷넃獰뉔궞[^ 111001111000101110001110111010111011001110100001111011011001000110111011111001101010110110101010111010101011100010110111111010111000010010000011111001111000110110110000111010111000100110010100111010101011011010011110010110111110011110001011100011101110101110110011101000011110110110010001101110111110011010101101101010101110101010111000101101111110101110000100100000111110011110001101101100001110101110001001100101001110101010110110100111100101101101011110 e78b8eebb3a1ed91bbe6adaaeab8b7eb8483e78db0eb8994eab69e5be78b8eebb3a1ed91bbe6adaaeab8b7eb8483e78db0eb8994eab69e5b5e
UHC 狎볡푻歪긷넃獰뉔궞[狎볡푻歪긷넃獰뉔궞[^ 111001001110010010010011111001111011111010000111111010001110000010110001111001011000011010010011111001111011111010000111111010011000001010110001010110111110010011100100100100111110011110111110100001111110100011100000101100011110010110000110100100111110011110111110100001111110100110000010101100010101101101011110 e4e493e7be87e8e0b1e58693e7be87e982b15be4e493e7be87e8e0b1e58693e7be87e982b15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)