To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 耀??悅i????[耀??悅i????[^ 100101110111001100111111001111111111101010111101100000101000100100111111001111110011111100111111010110111001011101110011001111110011111111111010101111011000001010001001001111110011111100111111001111110101101101011110 97733f3ffabd82893f3f3f3f5b97733f3ffabd82893f3f3f3f5b5e
EUC-JP 耀???i????[耀???i????[^ 11001101110101000011111100111111001111111010001111101001001111110011111100111111001111110101101111001101110101000011111100111111001111111010001111101001001111110011111100111111001111110101101101011110 cdd43f3f3fa3e93f3f3f3f5bcdd43f3f3fa3e93f3f3f3f5b5e
UTF-8 耀띺뒔悅i쪓僚묊룭[耀띺뒔悅i쪓僚묊룭[^ 111010001000000010000000111010111001110110111010111010111001001010010100111001101000001010000101111011111011110110001001111011001010101010010011111011111010011010111011111010111010110010001010111010111010001110101101010110111110100010000000100000001110101110011101101110101110101110010010100101001110011010000010100001011110111110111101100010011110110010101010100100111110111110100110101110111110101110101100100010101110101110100011101011010101101101011110 e88080eb9dbaeb9294e68285efbd89ecaa93efa6bbebac8aeba3ad5be88080eb9dbaeb9294e68285efbd89ecaa93efa6bbebac8aeba3ad5b5e
UHC 耀띺뒔悅i쪓僚묊룭[耀띺뒔悅i쪓僚묊룭[^ 111010011010010110001101111010011000101010010001111001101110110110100011111010011010010110001101111010001110100010010001111001111000111110100011010110111110100110100101100011011110100110001010100100011110011011101101101000111110100110100101100011011110100011101000100100011110011110001111101000110101101101011110 e9a58de98a91e6eda3e9a58de8e891e78fa35be9a58de98a91e6eda3e9a58de8e891e78fa35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)