To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鈺?К嵬よ?鸚??鈺?К嵬よ?鸚??^ 1111101111000100001111111000010001001011100110111100101010000010111001100011111111101010010111110011111100111111111110111100010000111111100001000100101110011011110010101000001011100110001111111110101001011111001111110011111101011110 fbc43f844b9bca82e63fea5f3f3ffbc43f844b9bca82e63fea5f3f3f5e
EUC-JP 鈺?К嵬よ?鸚??鈺?К嵬よ?鸚??^ 10001111111000111101010100111111101001111010110011010110110011001010010011101000001111111111001111000000001111110011111110001111111000111101010100111111101001111010110011010110110011001010010011101000001111111111001111000000001111110011111101011110 8fe3d53fa7acd6cca4e83ff3c03f3f8fe3d53fa7acd6cca4e83ff3c03f3f5e
UTF-8 鈺싩К嵬よ춾鸚㎪뤃鈺싩К嵬よ춾鸚㎪뤃^ 1110100110001000101110101110110010001011101010011101000010011010111001011011010110101100111000111000001010001000111011001011011010111110111010011011100010011010111000111000111010101010111010111010010010000011111010011000100010111010111011001000101110101001110100001001101011100101101101011010110011100011100000101000100011101100101101101011111011101001101110001001101011100011100011101010101011101011101001001000001101011110 e988baec8ba9d09ae5b5ace38288ecb6bee9b89ae38eaaeba483e988baec8ba9d09ae5b5ace38288ecb6bee9b89ae38eaaeba4835e
UHC 鈺싩К嵬よ춾鸚㎪뤃鈺싩К嵬よ춾鸚㎪뤃^ 11101000101011011001101011100111101011001010110011101000111000111010101011101000101011011001101011100101101001001010011111100110100011111011010011101000101011011001101011100111101011001010110011101000111000111010101011101000101011011001101011100101101001001010011111100110100011111011010001011110 e8ad9ae7acace8e3aae8ad9ae5a4a7e68fb4e8ad9ae7acace8e3aae8ad9ae5a4a7e68fb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)