To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????P??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101000000111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f503f3f3f3f3f3f3f
SJIS-WIN ?徐???③佃???堪??P?徐???③佃 00111111100011111001100100111111001111110011111110000111010000101001001011001111001111110011111100111111100010101010110000111111001111110101000000111111100011111001100100111111001111110011111110000111010000101001001011001111 3f8f993f3f3f874292cf3f3f3f8aac3f3f503f8f993f3f3f874292cf
EUC-JP ?徐????佃???堪??P?徐????佃 0011111110111101111110010011111100111111001111110011111111000100110100010011111100111111001111111011010010101110001111110011111101010000001111111011110111111001001111110011111100111111001111111100010011010001 3fbdf93f3f3f3fc4d13f3f3fb4ae3f3f503fbdf93f3f3f3fc4d1
UTF-8 렊徐렖롒뤏③佃첁찊쥙堪앉렻P렊徐렖롒뤏③佃 11101011101000001000101011100101101111101001000011101011101000001001011011101011101000011001001011101011101001001000111111100010100100011010001011100100101111011000001111101100101100101000000111101100101100001000101011101100101001011001100111100101101000001010101011101100100101011000100111101011101000001011101101010000111010111010000010001010111001011011111010010000111010111010000010010110111010111010000110010010111010111010010010001111111000101001000110100010111001001011110110000011 eba08ae5be90eba096eba192eba48fe291a2e4bd83ecb281ecb08aeca599e5a0aaec9589eba0bb50eba08ae5be90eba096eba192eba48fe291a2e4bd83
UHC 렊徐렖롒뤏③佃첁찊쥙堪앉렻P렊徐렖롒뤏③佃 1000111010100001110111111110111110001110101010111000111011010111100011111011111110101000111010011110111011101100101010101000111010101001100011101010001010001110110010101110110110111110110010011000111011000011010100001000111010100001110111111110111110001110101010111000111011010111100011111011111110101000111010011110111011101100 8ea1dfef8eab8ed78fbfa8e9eeecaa8ea98ea28ecaedbec98ec3508ea1dfef8eab8ed78fbfa8e9eeec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)