To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 攸???臟?????[攸???臟?????[^ 100111011011111100111111001111110011111111100100011001100011111100111111001111110011111100111111010110111001110110111111001111110011111100111111111001000110011000111111001111110011111100111111001111110101101101011110 9dbf3f3f3fe4663f3f3f3f3f5b9dbf3f3f3fe4663f3f3f3f3f5b5e
EUC-JP 攸???臟?????[攸???臟?????[^ 110110101100000100111111001111110011111111100111110001110011111100111111001111110011111100111111010110111101101011000001001111110011111100111111111001111100011100111111001111110011111100111111001111110101101101011110 dac13f3f3fe7c73f3f3f3f3f5bdac13f3f3fe7c73f3f3f3f3f5b5e
UTF-8 攸편렩ㅽ臟狀뀄렩ㅾ놈[攸편렩ㅽ臟狀뀄렩ㅾ놈[^ 111001101001010010111000111011011000111010111000111010111010000010101001111000111000010110111101111010001000011110011111111011111010011110111010111010111000000010000100111010111010000010101001111000111000010110111110111010111000011010001000010110111110011010010100101110001110110110001110101110001110101110100000101010011110001110000101101111011110100010000111100111111110111110100111101110101110101110000000100001001110101110100000101010011110001110000101101111101110101110000110100010000101101101011110 e694b8ed8eb8eba0a9e385bde8879fefa7baeb8084eba0a9e385beeb86885be694b8ed8eb8eba0a9e385bde8879fefa7baeb8084eba0a9e385beeb86885b5e
UHC 攸편렩ㅽ臟狀뀄렩ㅾ놈[攸편렩ㅽ臟狀뀄렩ㅾ놈[^ 11101010111100101100011011101101100011101011011110100100111011011110110111110100111011011110111010110010111011011000111010110111101001001110111010110011111100000101101111101010111100101100011011101101100011101011011110100100111011011110110111110100111011011110111010110010111011011000111010110111101001001110111010110011111100000101101101011110 eaf2c6ed8eb7a4ededf4edeeb2ed8eb7a4eeb3f05beaf2c6ed8eb7a4ededf4edeeb2ed8eb7a4eeb3f05b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)