To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???曄??碍る????仰??媛????? 0011111100111111001111111001111001000000001111110011111110001010010101101000001011101001001111110011111100111111001111111000101111000010001111110011111110010101010100010011111100111111001111110011111100111111 3f3f3f9e403f3f8a5682e93f3f3f3f8bc23f3f95513f3f3f3f3f
EUC-JP ???曄??碍る????仰??媛??獒?? 00111111001111110011111111011011101000010011111100111111101100111011011110100100111010110011111100111111001111110011111110110110110001000011111100111111110010011011001000111111001111111000111111001011101110110011111100111111 3f3f3fdba13f3fb3b7a4eb3f3f3f3fb6c43f3fc9b23f3f8fcbbb3f3f
UTF-8 琉꿩캒曄먯첈碍る젲琉꿩캒仰뜰냲媛쇗㎧獒앸젗 111011111010011110001100111010101011111110101001111011001011101010010010111001101001101110000100111010111010100010101111111011001011001010001000111001111010001010001101111000111000001010001011111011001010000010110010111011111010011110001100111010101011111110101001111011001011101010010010111001001011101110110000111010111001110010110000111010111000001110110010111001011010101010011011111011001000011110010111111000111000111010100111111001111000110110010010111011001001010110111000111011001010000010010111 efa78ceabfa9ecba92e69b84eba8afecb288e7a28de3828beca0b2efa78ceabfa9ecba92e4bbb0eb9cb0eb83b2e5aa9bec8797e38ea7e78d92ec95b8eca097
UHC 琉꿩캒曄먯첈碍る젲琉꿩캒仰뜰냲媛쇗㎧獒앸젗 111010111010010010110010111001101010111110011011111001111010010110010000111011001010101010010101111001001111010010101010111010111010000010100110111010111010010010110010111001101010111110011011111001001110011010110110111000111000011010000010111010101011000010111100111001101010011110111101111010001010001110011101111010111010000010010011 eba4b2e6af9be7a590ecaa95e4f4aaeba0a6eba4b2e6af9be4e6b6e38682eab0bce6a7bde8a39deba093

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)