To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????t????i?????t????iB 0011111100111111001111110011111100111111011101000011111100111111001111110011111101101001001111110011111100111111001111110011111101110100001111110011111100111111001111110110100101000010 3f3f3f3f3f743f3f3f3f693f3f3f3f3f743f3f3f3f6942
SJIS-WIN ?????t????i?????t????iB 0011111100111111001111110011111100111111011101000011111100111111001111110011111101101001001111110011111100111111001111110011111101110100001111110011111100111111001111110110100101000010 3f3f3f3f3f743f3f3f3f693f3f3f3f3f743f3f3f3f6942
EUC-JP ?????t????i?????t????iB 0011111100111111001111110011111100111111011101000011111100111111001111110011111101101001001111110011111100111111001111110011111101110100001111110011111100111111001111110110100101000010 3f3f3f3f3f743f3f3f3f693f3f3f3f3f743f3f3f3f6942
UTF-8 혪챌혮천챕t혞횊혥짙i혪챌혮천챕t혞횊혥짙iB 1110110110011000101010101110110010110001100011001110110110011000101011101110110010110010100111001110110010110001100101010111010011101101100110001001111011101101100110101000101011101101100110001010010111101100101001111001100101101001111011011001100010101010111011001011000110001100111011011001100010101110111011001011001010011100111011001011000110010101011101001110110110011000100111101110110110011010100010101110110110011000101001011110110010100111100110010110100101000010 ed98aaecb18ced98aeecb29cecb19574ed989eed9a8aed98a5eca79969ed98aaecb18ced98aeecb29cecb19574ed989eed9a8aed98a5eca7996942
UHC 혪챌혮천챕t혞횊혥짙i혪챌혮천챕t혞횊혥짙iB 1100001010010010110000111010011111000010100101011100001110110101110000111010100101110100110000101000100011000011100010001100001010001101110000101010001101101001110000101001001011000011101001111100001010010101110000111011010111000011101010010111010011000010100010001100001110001000110000101000110111000010101000110110100101000010 c292c3a7c295c3b5c3a974c288c388c28dc2a369c292c3a7c295c3b5c3a974c288c388c28dc2a36942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)