To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??純??偃??堰??節?ギ鴉??節ら? 100111101111010000111111001111111000111110000011001111110011111110011000111011100011111100111111100010011000000100111111001111111001000011011111001111111000001101001101111010011110101100111111001111111001000011011111100000101110011100111111 9ef43f3f8f833f3f98ee3f3f89813f3f90df3f834de9eb3f3f90df82e73f
EUC-JP 橈??純??偃??堰??節?ギ鴉??節ら? 110111001111011000111111001111111011110111100011001111110011111111010000111100000011111100111111101100011110000100111111001111111100000011100001001111111010010110101110111100101110110100111111001111111100000011100001101001001110100100111111 dcf63f3fbde33f3fd0f03f3fb1e13f3fc0e13fa5aef2ed3f3fc0e1a4e93f
UTF-8 橈띲굠純띹춳偃뗰쉔堰듸슬節꿴ギ鴉싷슬節ら솦 111001101010100110001000111010111001110110110010111010101011010110100000111001111011010010010100111010111001110110111001111011001011011010110011111001011000000110000011111010111001011110110000111011001000100110010100111001011010000010110000111010111001001110111000111011001000101010101100111001111010111110000000111010101011111110110100111000111000001010101110111010011011010010001001111011001000101110110111111011001000101010101100111001111010111110000000111000111000001010001001111011001000011010100110 e6a988eb9db2eab5a0e7b494eb9db9ecb6b3e58183eb97b0ec8994e5a0b0eb93b8ec8aace7af80eabfb4e382aee9b489ec8bb7ec8aace7af80e38289ec86a6
UHC 橈띲굠純띹춳偃뗰쉔堰듸슬節꿴ギ鴉싷슬節ら솦 111010001111101010001101111000111000001010001000111000101110110110001101111010001010110110001111111001011110011110001011111011111011110110101000111001011110100010110101111011111011110110111101111011111011110110110010111010011010101110101110111001001011110010011010111011111011110110111101111011111011110110101010111010011001100110011111 e8fa8de38288e2ed8de8ad8fe5e78befbda8e5e8b5efbdbdefbdb2e9abaee4bc9aefbdbdefbdaae9999f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)