To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ???熬??撓??}???熬??撓??{^ 00111111001111110011111111100000100100100011111100111111100111011001101000111111001111110111110100111111001111110011111111100000100100100011111100111111100111011001101000111111001111110111101101011110 3f3f3fe0923f3f9d9a3f3f7d3f3f3fe0923f3f9d9a3f3f7b5e
EUC-JP ???熬??撓??}???熬??撓??{^ 00111111001111110011111111011111111100100011111100111111110110011111101000111111001111110111110100111111001111110011111111011111111100100011111100111111110110011111101000111111001111110111101101011110 3f3f3fdff23f3fd9fa3f3f7d3f3f3fdff23f3fd9fa3f3f7b5e
UTF-8 玲잙젗熬곷젨撓뉗짆}玲잙젗熬곷젨撓뉗짆{^ 111011111010011010101101111011001001111010011001111011001010000010010111111001111000011010101100111010101011001110110111111011001010000010101000111001101001001010010011111010111000100110010111111011001010011110000110011111011110111110100110101011011110110010011110100110011110110010100000100101111110011110000110101011001110101010110011101101111110110010100000101010001110011010010010100100111110101110001001100101111110110010100111100001100111101101011110 efa6adec9e99eca097e786aceab3b7eca0a8e69293eb8997eca7867defa6adec9e99eca097e786aceab3b7eca0a8e69293eb8997eca7867b5e
UHC 玲잙젗熬곷젨撓뉗짆}玲잙젗熬곷젨撓뉗짆{^ 111001111011111110011111111010111010000010010011111010001010001010000001111010111010000010100000111010001111010110000111111011001010001110010101011111011110011110111111100111111110101110100000100100111110100010100010100000011110101110100000101000001110100011110101100001111110110010100011100101010111101101011110 e7bf9feba093e8a281eba0a0e8f587eca3957de7bf9feba093e8a281eba0a0e8f587eca3957b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)