To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 厓??渦??鵝??}厓??渦??鵝??{^ 111110101000110100111111001111111000100101010001001111110011111111101010010000000011111100111111011111011111101010001101001111110011111110001001010100010011111100111111111010100100000000111111001111110111101101011110 fa8d3f3f89513f3fea403f3f7dfa8d3f3f89513f3fea403f3f7b5e
EUC-JP 厓??渦??鵝??}厓??渦??鵝??{^ 1000111110110100110001110011111100111111101100011011001000111111001111111111001110100001001111110011111101111101100011111011010011000111001111110011111110110001101100100011111100111111111100111010000100111111001111110111101101011110 8fb4c73f3fb1b23f3ff3a13f3f7d8fb4c73f3fb1b23f3ff3a13f3f7b5e
UTF-8 厓⒴끀渦겼겢鵝녻뇮}厓⒴끀渦겼겢鵝녻뇮{^ 111001011000111010010011111000101001001010110100111010111000000110000000111001101011100010100110111010101011001010111100111010101011001010100010111010011011010110011101111010111000010110111011111010111000011110101110011111011110010110001110100100111110001010010010101101001110101110000001100000001110011010111000101001101110101010110010101111001110101010110010101000101110100110110101100111011110101110000101101110111110101110000111101011100111101101011110 e58e93e292b4eb8180e6b8a6eab2bceab2a2e9b59deb85bbeb87ae7de58e93e292b4eb8180e6b8a6eab2bceab2a2e9b59deb85bbeb87ae7b5e
UHC 厓⒴끀渦겼겢鵝녻뇮}厓⒴끀渦겼겢鵝녻뇮{^ 111001001110110110101001111001011000010110110110111010001011111010110000111001011000000110110100111001001011110110000110111010001000011110010011011111011110010011101101101010011110010110000101101101101110100010111110101100001110010110000001101101001110010010111101100001101110100010000111100100110111101101011110 e4eda9e585b6e8beb0e581b4e4bd86e887937de4eda9e585b6e8beb0e581b4e4bd86e887937b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)