To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 溜삘뵗溜쒕졎栒뷀썒栒붾꽇溜삘뵗溜쒕졎栒뷀썒栒붾꽇B 11101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111101100100100101001010111101100101000011000111011100110101000001001001011101011101101111000000011101100100011011001001011100110101000001001001011101011101101101011111011101010101111011000011111101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111101100100100101001010111101100101000011000111011100110101000001001001011101011101101111000000011101100100011011001001011100110101000001001001011101011101101101011111011101010101111011000011101000010 efa78bec8298ebb597efa78bec9295eca18ee6a092ebb780ec8d92e6a092ebb6beeabd87efa78bec8298ebb597efa78bec9295eca18ee6a092ebb780ec8d92e6a092ebb6beeabd8742
UHC 溜삘뵗溜쒕졎栒뷀썒栒붾꽇溜삘뵗溜쒕졎栒뷀썒栒붾꽇B 11101010111111101011101111100010100101001001100111101010111111101001110011101011101000001011101111100010111000111001010011101101100110111000010111100010111000111001010011101011100001001001100111101010111111101011101111100010100101001001100111101010111111101001110011101011101000001011101111100010111000111001010011101101100110111000010111100010111000111001010011101011100001001001100101000010 eafebbe29499eafe9ceba0bbe2e394ed9b85e2e394eb8499eafebbe29499eafe9ceba0bbe2e394ed9b85e2e394eb849942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)